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2CTS OF 
INDUCTION OI 


HERBERT rem, Je, 


The relationship between Gibson's 
negative aftereffect and the 

Wallach figural aftereffect 
the subject of controversy. 


Kohler- 
has been 


1933) found that a curved line 


when perceived for a period of time, 


Gibson 
becomes 
phenomenally less curved than it was origi 
nally, and that after such an inspection period 
an objectively straight line appears curved in 
the from the inspection 
curve alterefiects at- 
tributed to nsory 
which perceptions 
when long continued tend to approac h the 


opposite direction 


These negative were 


to a process sim lar 


adaptation in curvature 
Gibson regards the 
which other 


norm of a straight line 


neutral from 


Thus a straight line will serve 


straight line as a 
lines deviate 
as perception ol 
of the 
a norm of the 


per 


an anchoring point in the 
\ 


tends to become 


curved lines frequent condition 
environment 
phenomenal world and new stimuli are 
ceived in relation to it. 


K6hler and Wallach (1944 
of figural aftereffects postulated an electrical 


in their studies 


field process in the visual cortex which satiates 
the of the 
cortical representation of the inspection figure 


cortex in the immediate area 


This research was supported in part by 
the Research of the Graduate 
School of the University of Wisconsin with 
funds provided by the Wisconsin Alumni Re- 
search Foundation. Authors are indebted to 
J. J. Gibson for a critical and helpful reading 
of the manuscript 
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AND 
Wisconsin 
Chis satiation results in increased resistance 


to further stimulation this to 
displacement of the cortical representation to 


in area and 
neighboring regions upon subsequent stimu 
Kéhler and Wallach proposed that 
curved line effect could 
quately explained by their satiation theory, 
if the test line were considered to be displaced 
the of the inspection 


lation. 


Gibson's be ade- 


irom satiation area 

curve 

Although Osgood and Hever (1952) 
an alternative to explain 

figural aftereffects, both their theory and the 

Kéhler-Wallach theory are based upon the 


same sort of physiological satiation by pro 


sug 


mechanism 


gest 


longed inspection of contours. 

Several recent attempts have been made 
» differentiate experimentally the 
negative aftereffect from the Kéhler-Wallach 
figural aftereffect. Sagora and Oyama (1957 
in their survey of studies of figural aftereffects 


t Gibson 


in Japan cited considerable evidence to in- 
dicate that the curved line effect could not be 
explained by the K6hler-Wallach theory 

Bergman and Gibson (1959), using a slanted 
textured surface as a stimulus, demonstrated 
that one type of negative aftereffect could not 
be explained on the basis of satiation and con- 
displacement (1959a) has 
argued on logical grounds that the two kinds 
of aftereffect cannot possibly be the same, 
and has recently (Gibson, 1959b, pp. 489-491) 
restated the normalization hypothesis. 


tour Gibson 


Such evidence as that reported by 
Sagora and Oyama (1957) and Berg- 


425 
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man and Gibson (1959) suggest that 
different processes are involved in the 
two types of aftereffect. Further in- 
vestigation seemed warranted to eluci- 
date the nature of these differences 
and to clarify the theoretical bases of 
the phenomena. 

The K6hler-Wallach theory sug- 
gests a gradual dissipation with time 
of the differential satiation in the 
visual cortex. If the dissipation is 
spontaneous, and if satiation explains 
both contour-displacement and cur- 
vature-straightening, it could be 
predicted that with homogeneous 
stimulation (e.g., a ‘‘ganzfeld’’) inter- 
polated between the viewing of the 
inspection figure and of the test figure, 
both types of visual aftereffects should 
decrease. The Gibson theory, on the 
other hand, suggests that the normal- 
ization of a curved line would not 
dissipate during an afterperiod of 
homogeneous stimulation since the 


aftereffect represents simply a re- 


adaptation to the conditions of stimu- 
lation prevailing before the experi- 
ment. If the effects of homogeneous 
stimulation on the two types of visual 
aftereffects differ, the distinctiveness 
of the two phenomena will have been 
demonstrated and the adequacy of 
either theory to explain such differ- 
ential results can then be evaluated. 


METHOD 


In order to test the above predictions three 
different postinspection visual experiences 
were introduced following the induction of a 
K6hler-Wallach aftereffect and a 
curved line aftereffect, and their influence 
upon the aftereffects was then measured. 

Subjects.—The Ss for the Kéhler-Wallach 
figural aftereffect were 36 


Gibson 


summer school 


students registered in introductory psychol-: 


ogy courses; Ss for the Gibson negative after- 
effect were 30 summer school students enrolled 
in introductory psychology or education 
courses. 


Apparatus. 
Wallach effect 


lhe apparatus for the Kéhler- 
was a modified Dodge type 
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tachistoscope in which the inspection (1) 
figure was shown binocularly at a distance of 
58 cm. from the eye. The test (T) figures 
were exposed at an equal distance. In both 
cases the visual field was 19.7 cm. square 
The | figure consisted of a circle 5.5 cm. in 
diameter which was placed to the left of a 
fixation point in the center of the visual field 
The width of the line defining the circumfer- 
ence of the circle was 2 mm. Each T figure 
was drawn on white cardboard and consisted 
of two circles placed on opposite sides and 
equally distant from a fixation point. The 
width of the line in the T circles was .75 mm 
In half the test trials the circle on the left was 
7.6 cm. in diameter and would tend to be ex- 
panded outward to produce an aftereffect of 
increased size. These circles were always 
concentric with the original | figure. When 
the left circle was 7.6 cm. in diameter, the 
circle on the right was 7.6, 7.8, 8.0, 8.2, or 
8.4 cm. in diameter. In the other half of the 
test trials the circle on the left was 4.0 cm. in 
diameter and would tend to be compressed 
inward to produce an aftereffect of decreased 
size. When the circle on the left of the 1 
figure was 4.0 cm. in diameter the circle on the 
right was 3.6, 3.7, 3.8, 3.9, or 4.0 cm. in 
diameter. In both cases the size of the right 
circle was varied randomly from trial to trial. 
‘he use of two different size circles on the left 
was necessitated by the possibility of response 
bias occurring if the figural aftereffects always 
occurred in the same direction. A measure 
of the magnitude of the figural aftereffect 
could be determined by noting size judgments 
of circles on the right in relation to circles on 
the left. The method of obtaining this meas- 
ure is discussed in the section on scoring of 
performance. 

The Gibson visual aftereffect was produced 
by having .Ss fixate an | figure consisting of a 
black curved line 3 mm. in width and convex 
to the left drawn on white cardboard. In 
order to eliminate straight reference lines, the 
| line presented through a 
aperture 30 cm. from the eye. The diameter 
of the aperture was 25.5 cm. The curved 
line extended the full length of the aperture 
and was bowed 2 cm 


was circular 


A series 
of five T lines was constructed and presented 


at its center. 
in the same manner. The T lines, 3 mm. in 
thickness, were also bowed to the left by 0.0, 
0.1, 0.2, 0.3, and 0.5 cm. The 1 
presented in a random order. 
of the aftereffect 


lines were 
Che magnitude 
could be determined by 
noting which of these lines were perceived as 
straight. 


The brightness of the larger surfaces (walls, 
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table-top) to which S was exposed under 
conditions of normal stimulation varied from 
4.0 to 4.4 ft-c as measured by a Macbeth 
illuminometer. The goggles for the ganzfeld 
condition passed approximately 25% of the 
illumination. The brightness of the I and 1 
fields for the Kéhler-Wallach figural after- 
effects was 2.7 ft-c. The brightness of the | 
and T field for the Gibson aftereffects was 
3.2 ft-c. 

Procedure.—Hammer (1949), in the only 
published study of the dissipation of figural 
aftereffects, found that they decreased to 
zero in 150 sec. Pilot studies, using the 
conditions outlined above, indicated that with 
5-min. | periods and subsequent normal 
visual conditions the Kéhler-Wallach after- 
effect diminished to zero in approximately 
30 min.; the Gibson effect in 15 min. These 
time intervals were consequently used for the 
postinduction visual exposure periods in the 
respective experiments. Each experiment 
consisted of five steps in the following order: 
an initial control test with the T figures, a 
5-min. fixation of the I figure, an immediate 
test of the magnitude of the aftereffect, a 
postinspection period in one of three visual 
conditions, and a final test of the magnitude 
of the aftereffect. 

In the Gibson procedure the control test 
consisted in making judgments of the direction 
of curvature of the five curved test figures 
convex to the left plus two curved figures 
convex to the right rhe latter curved lines 
were included to accustom S to perceive lines 
curved in both directions. These lines were 
presented in random order. This procedure 
required approximately 20 sec. depending on 
the speed of judgment of S. During the 
5-min. I period which followed the control 
test, S was instructed to run his eyes slowly 
up and down the middle portion of the | 
line. The immediate test of the magnitude 
of the aftereffect was the same as the control 
test except for the omission of the curves 
convex to the right. One-third of the Ss then 
received each of the following postinspection 
conditions for 15 min.: Cond. I, normal 
stimulation, S looked around the room; Cond. 
Il, homogeneous light stimulation, S’s vision 
limited to a homogeneous ganzfeld, 
produced by goggles which covered each eye 
with a concave section of a translucent ping 
pong ball; Cond. III, homogeneous lack of 
stimulation, S had a black blindfold over open 
eyes. A final test of the magnitude of the 
aftereffect was administered as before. 

In the Kéhler-Wallach procedure the 


initial control test consisted of viewing the 10 


was 


test cards with two black circles on them 
Cards were presented in random order at 4- 
sec. intervals. The S was instructed to fixate 
on the cross between the circles and report 
which of the circles was larger or if they were 
equal in size. During the 5-min. I period S 
was directed to fixate on the | 
figure. The subsequent test utilized the 
same procedure and stimuli as the control 
test. One-third of the Ss were then exposed 
for 30 min. to each of the three postinspection 
conditions described above and finally the 
third test of the magnitude of the figural 
aftereffect was administered in the 
manner as the previous ones. 

Scoring of performance.—The scoring pro- 
cedure for the Kéhler-Wallach aftereffect was 
as follows: An arbitrary scoring system as- 
signed numerical values to the size of the 
figural aftereffect. If the 7.6-cm. circle on the 
right was judged to be equal to the 7.6-cm 
circle on the left (standard), a score of zero 
was assigned; if it was judged smaller (this 
being evidence of a figural aftereffect), a 
score of +1 was assigned. If the 7.8-cm. 
circle on the right was judged smaller than the 
standard, a score of +2 was assigned. Thus 
the weights were increased by integral units 
as the magnitude of figural aftereffect in- 
creased. An identical procedure was used 
with the 4.0-cm. standard circle, but since the 
effect of the inspection was to make the 4.0- 
cm. standard appear smaller, evidence of the 
visual aftereffect was a tendency of the circle 
on the right to appear larger. 

A similar procedure was used to score the 
magnitude of the aftereffect of the Gibson 
type. 


the cross on 


same 


RESULTS AND DISCUSSION 


With the scoring procedures a value 


was computed for each S for the 
control test, the initial aftereffect test, 
and the final aftereffect test. These 
were computed by totaling the num- 
ber of points obtained by S on all the 
test stimuli. The average values 
obtained and their SDs are shown in 
Table 1. In each experiment the 
values obtained for each of the three 
groups could be compared. The dif- 
ferences between the magnitude of the 
aftereffect (immediately after inspec- 
tion) and the final magnitude (after 
the interpolated visual experience) 
were analyzed with each S serving as 
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rABLE 1 


AVERAGE 


Kéhler-Wallach 


Blindfold 


2.8 
6.6 
1.8 


Control ; 
Initial aftereffect 
Final aftereffect 


his own baseline. Negative differ- 
ences occurred where the final after- 
effect test resulted in a 
than the postinspection 
test, i.e., a decrease of the aftereffect 
over time. Conversely, positive dif- 
ferences indicated an increase of after- 
effect over time. 

The results shown in Table 1 sug- 
gest that with the K6éhler-Wallach 
experiment the ganzfeld condition 
to enhance the aftereffect, 
and although the blindfold condition 
does not enhance the aftereffect it does 
reduce the amount of decay. The 
greatest decrease in aftereffect occur- 
red with the normal vision group. 
With the Gibson aftereffect the great- 
est decay occurred under the ganzfeld 
condition. These findings are verified 
when the data subjected to 
analysis of variance. 

With the Kéhler-Wallach effect the 
three groups are significantly different 
from each other (P < .01) the 
ganzfeld group shows a. significant 
enhancement effect (P < .05), i.e., an 
increase in magnitude of aftereffect 
over the 30-min. period. A significant 
difference (P < .025) between groups 
found with the Gibson aftereffect is 
apparently due to the fact that the 
ganzfeld group had somewhat inflated 
values in the initial aftereffect test. 
In any case no significant differences 
in final level of aftereffect occurred 
with the Gibson aftereffect, i.e., the 


lower score 


immediate 


serves 


are 


and 


MAGNITUDES AND SDs OF 


AFTEREFFECT SCORES 


Gibson 


Ganzfeld Blindfold Ganzfeld 


Mean 


SD SD 


SD |Mean 


0. 
4. 
0.6 
aftereffect had vanished completely 
under all three visual conditions. 

Since conditions of homogeneous 
stimulation effect these two visual after- 
effects in radically different ways, it is 
strongly implied that the two phenomena 
are basically different. 

The results are, in general, opposite to 
what might be expected if each theory 
predicted the outcome for its own type 
of aftereffect. The inference from the 
Koéhler-Wallach theory was that a dimi- 
nution of the aftereffect occur 
following conditions of homogeneous 
stimulation. However, the Kéhler-Wal- 
lach aftereffect was enhanced by the 
ganzfeld situation. Gibson's normaliza- 
tion theory would predict a maintenance 
of the curved line aftereffect under homo- 
geneous but instead this 
effect had essentially vanished. 


would 


stimulation 


The nature of the perceptual system 
which might account for the paradoxical 
enhancement or the maintenance of the 
Kohler-Wallach effect warrants further 
investigation. It may be related to the 
higher susceptibility to figural after- 
effects found after a period of sensory 
deprivation (Doane, Manatoo, Heron, & 
Scott, 1959). In the present study, if 
the ganzfeld and blindfold conditions 
were considered conditions of sensory 
deprivation it would suggest that stimu- 
lation just prior to such deprivation has a 
particularly strong effect. If the homo- 
geneous conditions are related to sensory 
deprivation the differences obtained 
under the two conditions require further 
investigation. 
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SUMMARY 


rhe effects of three conditions of post 
stimulation on a Kéhler-Wallach figural after 


effect and a Gibson negative aftereffect were 


investigated. Condition | was normal stim 
ulation, obtained by looking around the room ; 
Cond. II was homogeneous lack of stimula 
tion, obtained by wearing a black blindfold; 
Cond. III was homogeneous light stimulation 
obtained by exposure to a “‘ganzfeld.”” The 
Gibson aftereffect decreased normally 
all three conditions 


under 


The ganzfeld enhanced 


the Koéhler-Wallach aftereffect, the blindfold 
the decrease in the aftereffect, and 
looking around the room permitted the nor- 
mal disappearance of the figural aftereffect 


retarded 
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EFFECTS OF PROBABLE OUTCOME INFORMATION 
ON TWO-CHOICE LEARNING 


RICHARD C. NIES! 


University of California, Los Angeles 


The impact of statistical learning 
theory (Bush & Mosteller, 1955; 
Estes, 1950) has focused attention 
upon probability learning in the two- 
choice situation (Goodnow, 1958; 
Humphreys, 1939). Although success 
is maximized by consistent choice of 
the more likely alternative, Ss typ- 
ically approximate the actual propor- 
tions of the reinforcement schedule 
(Hake, 1955; Jenkins & Stanley, 
1951). However, when the task is 
presented in a gambling context (i.e., 
total correct choices are maximized), 
Ss tend to predict the more frequent 
event at a significantly higher propor- 
tion of the trials (Goodnow, 1955; 
Siegel & Goldstein, 1959). The pres- 
ent study was designed to further 
explore the conditions which maximize 
success by manipulating information 
about the probability of the events. 
Although responses in  two-choice 
situations appear to be relatively 
independent of experimental instruc- 
tions (Anderson & Grant, 1957; 
Neimark & Shuford, 1959) and per- 
formance information (Das, 1961), 
Koehler (1961) was able to predict 
mean terminal response rates by vary- 
ing instructions which dealt with how 
Ss should consider the nonreinforced 
trials in a_ two-choice contingent 
partial reinforcement situation. In 
view of this finding, the stability of 
behavior to the manipulative effects 
of differential information in such 
situations is to be questioned. It was 

! The author expresses his appreciation to 
Allen Parducci and Norman H. Anderson for 
their invaluable guidance in the design and 
analysis of this experiment. 


therefore hypothesized that specific 
information about the probabilities of 
two alternative outcomes would pro- 
duce a shift toward consistent selec- 
tion of the more likely event. 


METHOD 


A pparatus.—A box containing 100 marbles, 
70 of one color and 30 of another color, was 
mounted so that the marbles could be mixed 
by turning a crank protruding from one end. 
A trough at the other end received 1 marble 
whenever the box was tilted forward, the 
marble rolling back into the box when the 
apparatus was returned to its normal position. 

Subjects —Eight experimental and four 
control sessions were held, with groups of 16 
Ss drawn for each session from the course in 
introductory psychology at the University of 
California, Los Angeles. 

Procedure-—The following instructions 
were read aloud to all Ss in the experimental 
groups: 


This experiment is designed to study 
human guessing habits. In this box are a 
number of blue and red marbles (shown 
briefly). For each trial, I will shake the box 
thoroughly and allow one marble to roll into 
the trough (demonstrated). While I am 
shaking the box I will count to three 
(demonstrated), and by the time I say 
“three,”’ you write down on the response 
sheet, which will be given you shortly, 
whether you think a blue marble or a red 
marble will roll out. After you have written 
your choice, I will allow a marble to roll out 
and you will then be told its color. Your 
task is to get as many correct predictions 
as you can. 


To assure the 70-30 ratio of blue to red 
marbles, a secret sheet was prepared which 
maintained this schedule randomly in blocks 
of 50 (35 blue marbles to 15 red marbles) 
over 250 trials. The random sequences were 
chosen with no constraints from a table of 
random numbers according to their probabili- 
ties. Although an assistant read the color of 
the marble for each trial from this prepared 
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schedule, Ss were led to believe that he was 
reporting the color of the marble that actually 
rolled out. This procedure insured that the 
different groups were exposed to identical 
sequences 

The Ss in each experimental session were 
randomly divided into four equal subgroups, 
differentiated with respect to the additional 
information printed at the top of their re 
sponse sheets. The No-Information group 
had no additional information. The Patter: 
group had the following sentence added: “‘It 
has been found in experiments of this kind 
that the marbles roll out in definite patterns.’ 
The Ratio group had the statement: ‘There 
are 100 marbles in the box: 70 are blue and 30 
And the Ratio-Explanation group 
was informed that: ‘‘There are 100 marbles i: 
the box: 70 are blue and 30 are red. This 
that the blue marble 
rolling out are 7 out of 10, and the chances of 


are red.” 


means chances of a 
a red marble rolling out are 3 out of 10 
Since each marble that rolls out is returned 
to the box, these odds will be the 
each trial. Furthermore, 
shaken thoroughly each time, there can be no 
fixed pattern in which the marbles roll out 


same lor 


since the box is 


For all conditions, S was to indicate his 
prediction by recording either an “R”’ or a 
**B” on each trial. 
To control for color prelerences and 
sequence peculiarities, half the experimental] 
Ss were run with colors reversed (i.e., 
to 30 blues), half run 
different 70-30 sequence lo control for 
unique group differences, each of these four 
experimental combinations replicated 
with a second group The result was a 
2 X 2 & 4 factorial design (Color K Sequence 


< Information) replicated with a second set 


70 reds 


and were using a 


was 


of four experimental groups 
Four control groups (two groups for eat h 
reinforcement sequences under the .70-red 
were also run to provide a 
which the effects of the 


marble box could be They 


condition only) 
baseline against 
assessed were 
read the following instructions 

his experiment is designed to study 
human guessing habits. For each trial, | 
will call out one of two colors—blue or red 
Before I announce the color, | will count to 
three (demonstrated). While | am count 
ing, you write down on the response sheet, 
which will be given you shortly, whether 
you think I will call out blue or red. After 
you have written your choice, | will then 
announce the color. Your task is to get as 
many correct predictions as you can 
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RATIO- EXPLANATION GROUP 
RATIO GROUP 

PATTERN GROUP 

NO INFORMATION 

CONTROL GROUP 





CHOICES FOR 70 BEAD 





4 4 


1S!- 200 201-250 





BLOCKS OF TRIALS 


MEAN PROPORTIONAL 


Fic. 1. 
70 marble by trial blocks under various levels 
of information 


Mean proportional choices for the 


RESULTS AND DISCUSSION 


Figure 1 shows the mean proportion 
of choices for the more likely alterna- 
tive over each successive block of 50 
trials. Separate analyses of variance 
were performed for each block of 50 
trials, using the mean proportions as 
raw scores. Table 1 summarizes the 
analysis for the first 50 trials, showing 
the significant effect of Information 
and also of the interaction between 
Information and Sequence. This 
interaction was found only in the first 


rABLE 1 


\NALYSIS OF VARIANCE FOR TRIAL BLOCKS 
1-50 For EXPERIMENTAL GROUPS 


Source MS |} F 
Instructions (1) 639.80 | 14.55** 
Color (C) 

Sequence (S 

Bt 

Ft. 

a 

1xXCxXxS 

Groups within treat- 
ments* 

Pooled Groups X In- 
structions 12 

Ss within groups” 96 


* Error term for C, S, and C XS 


> Error terms for I, I XC, I XS, IXC XS 
Groups within treatments, and Pooled Groups X I 

* Significant at .05 level 

* Significant at .001 level 
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block of 50 trials, so it was not judged 
as critical in modifying the interpreta- 
tion of the main effects. In the other 
blocks, the only significant effect that 
appeared was that for Information in 
Trial Blocks 51-100 and 101-150 
(F = 3.64 and 3.61, respectively; 
df = 3/96; P < .05). The different 
levels of information thus produced 
significant differences in betting be- 
havior over the first three-trial blocks. 

Since the experimental conditions 
revealed no significant effects for 
Trials 201-250, the scores for this 
block were pooled (using the .70-red 
condition only) and tested against the 
control scores for the same _ block. 
The difference was highly significant 
(F = 16.33; df = 1/100; P < .001). 

These analyses support the follow- 
ing conclusions: (a) The information 
about probable event outcomes affect 
Ss’ responses during the early blocks 
of trials (up to 150 trials), but shows 
no significant effect in later blocks. 


These early differences appear to be 
relatively independent of color prefer- 


ences, sequence peculiarities, and 
group differences. (b) While the con- 
trol groups (no marbles) just reach 
the probability matching level on the 
last block of trials, the experimental 
groups (marbles) exceed this level for 
all the trial blocks. Relevant to this 
finding is the proposal by Flood (1954) 
and Rubinstein (1959) that the aware- 
ness of randomness in a_ two-choice 
probabilistic outcome situation tends 
to elevate S’s response predictions, 
the impossibility of a complete solu- 
tion increasing S’s caution against 
betting on the less likely alternative. 
Randomness was made explicit in the 
present study by the use of a box in 
which marbles were thoroughly mixed 
in Ss’ presence before each prediction 
was made. In the control groups 
with no marbles, Ss displayed typical 
probability matching behavior. 


While consistent with the interpreta- 
tion made by Flood and Rubinstein, the 
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present experiment advances their argu- 
ment by demonstrating an early facilita- 
tion of optimal betting through the 
introduction of specific information re- 
levant to the probable outcome of the 
events. While the presence of the marble 
box can account for the higher response 
level of the experimental groups, pre- 
sumably through its contribution to the 
random appeararice of the events, the 
perception of randomness was hastened 
by the critical information supplied to 
Ratio and Ratio-Explanation groups. 
When information is effective in deter- 
mining behavior in a two-choice, un- 
certain outcome situation, its effective- 
ness may be based on its contribution to 
the perception of randomness. The fact 
that the instructions used by Anderson 
and Grant (1957) and Neimark and 
Shuford (1959) did not elevate Ss’ 
response levels may result from a failure 
to make explicit the randomness and 
impossibility of a complete solution. 
Although the experimental conditions 
elevated the proportion of correct antici- 
pations, only 4 Ss (all from the Ratio- 
Explanation group) learned to bet 100% 
on the more likely alternative. Two 
this lack of 
find support in the 
present research: (a) The Ss seem sur- 
prisingly expectant of patterns and sys- 
tems when there is any challenge of a 
problem to be solved (Goodnow, 1958). 
This is consistent with the responses to a 
questionnaire given at the conclusion of 
the present experiment where 75 of the 
128 experimental Ss reported looking for 
a pattern whereas only 32 had been told 
(falsely) that there was one. Also, as 
shown in Fig. 1, the betting curve for the 
Pattern group is fairly close to that of the 
No-Information group. 


suggestions for relative 


optimal betting 


Moreover, while 
patterns may be expected from the prob 
lem solving nature of the task, certain 
characteristics of the data support it. In 
contrast to the method of constrained 
randomization used by Edwards (1961; 
Lindman & Edwards, 1961) and Nicks 
(1959), the method used in this research 
involved an unrestrained randomization, 
resulting in a distribution of lengths of 
homogeneous outcome runs which in- 
cluded many short 
enough 


too and not 


there 


runs 


long runs. Thus were 
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patterns in the outcome sequence which 
allowed for the gambler’s fallacy. The 
point to be made here is that this reduc- 
tion from optimal betting is not “‘irra- 
tional’ in terms of the Ss’ set to expect 
patterns. (b) Furthermore, the Ss prefer 
the occasional success of guessing the 
unlikely alternative to the monotony of 
repetitiously predicting the more certain 
event where no tangible inducements are 
offered to maximize their correct predic- 
tions (Goodnow, 1955; Siegel & Gold- 
1959). Thus, there is a second 
of reinforcement at work, the 
utility of correctly predicting the occur- 
rence of the less frequent event, which 


stein, 
source 


subtracts from maximum gain responding 
(Brackbill, Kappy, & Starr, 1962). This 
also is consistent with the questionnaire 
responses; 77 
that 


of the experimental Ss 


reported they knew the wisest 


procedure would be to bet consistently 


on the more likely alternative 

The data, thus, suggest a compromise 
between the expedience of betting 100% 
for the more likely alternative (because 
of the “random” 


“chance” or structure 


made patent by the experimental ap 
paratus) and the challenge of betting 
70% for the more likely alternative (i.e., 
trying out different patterns in an effort 


to ‘‘beat’’ the game). 


SUMMARY 


The experiment investigated the effect of 
different levels of probability information on 
response frequencies in a random, two-choice 
with probabilities 
This information was demonstrated to have 
differential during the early trials 
In addition, it was found that Ss reached a 
significantly higher response level when the 
outcome of a trial appeared to depend upo 
the chance drawing ot 


situation unequal event 


effects 


a marble from a box 
than when the marble box was absent 
findings were interpreted in 


rhese 
terms of the 
perception of “randomness.”’ 
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There is relatively little available 
data on the effectiveness of color as a 
coding dimension for information 
transmission in visual displays. Some 
studies have been published exploring 
the limitations of our abilitv to 
identify different hues on the basis of 
absolute discrimination. They sug- 
gest that under good viewing con- 
ditions 5 8 colors 
tinguished reliably (Conover, 1959; 
Conover & Kraft, 1958; Eriksen & 
Hake, 1955), or perhaps even as many 


to can be dis- 


as 9 to 12 under optimal conditions 
(Chapanis & Halsey, 1956; Halsey & 
Chapanis, 1951). 

Whichever number we accept, this 
means that color as a coding dimen- 
sion can be used to distinguish only a 


relatively small number of categories, 
as compared with shape coding (sym- 


which carries the bulk of 
information in visual displays. How- 
ever, within this limitation, there is 
evidence that color better than 
shape in tasks which involve locating 
displayed data (Christner & Ray, 
1961; Hitt, 1961). 

The effectiveness of color coding 
for locating particular displayed items 
is related, by extension, to its use for 
providing visual separability among 
data classes. Results on this question 
have been reported by Green and 
Anderson (1956). They examined the 
degree to which color coding permitted 
visual separability of displayed two- 


bology) 


is 


The research reported in this article was 
supported by the Department of the Air 
Force under Contract AF-33(600)39852. <A 
more detailed account of this research was 
published a MITRE Technical Series 
Report, MTS-7, “Display Color Coding for a 
Visual Search Task,”” June 1962. 


as 


$34 


digit numbers, measured by de- 
creases in average visual search time 
when the color of a ‘‘target’’ number 
was known beforehand (relevant) as 
compared with other occasions when 
it was not known (nonrelevant). 
They (Green & Anderson, 1956) sum- 


marized their conclusions as follows: 


as 


When Os know the color of the target, the 
search time is approximately proportional to 
the number of symbols of the target’s color 
rhere is also a slight increment in search time 
to the the wrong-colored 
When Os do not know the target's 
color, search time depends primarily on the 
total number of symbols on the display. 
However, search times are slightly longer for 
multicolored displays than for comparable 
single-colored displays (p. 24 


due of 


targets 


presence 


In an attempt to confirm these re- 
sults, this present experimental study 
was conducted. This study accepts 
the basic premise of Green and Ander- 
son that visual search time is a funda- 
mental measure of the potential value 
of display color coding. It simply 
expanded their model to include a 
greater range of displayed densities, 
more displayed colors, both light and 
dark display backgrounds, and certain 
modified techniques of display pres- 
entation. 

PROCEDURE 


Iwelve Ss participated in this study, 11 
and 1 woman. Preliminary testing 
using the American Optical Company H-R-R 
pseudoisochromatic plates confirmed that all 
Ss had normal color vision. In the course of 
the study, each S made a total of 300 visual 
searches, using a variety of different displays. 
Chis required several experimental sessions 
for each S._ Individual Ss worked for no 
longer than 1 hr. at a time, and for no more 
than 2 hr. per day. 


men 


The displays consisted of varying arrays of 


three-digit numbers These numbers were 
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randomly placed in a square field, which can 
be imagined as comprising 13 columns and 
27 rows, for a total of 351 possible positions. 
The numbers themselves were chosen ran- 
domly from the 1000 possibilities (000 through 
999) with certain restrictions: the numbers on 
each particular display were all different; 
they were unique in terms of their first two 
digits; their third digits represented an equal 
sample of each of the 10 possibilities (0 
through 9). In the case of multicolored 
displays, the particular color of each three- 
digit number was also chosen at random, 
with the restriction that all were 
equally represented. 

The displays were made up as 2 X 2 in. 
color slides, and presented to Ss by rear 
projection on the screen of an experimental 
console. The dimensions of the digits as 
projected on this screen were } in. high X 
4 in. wide. Viewing distance was about 18 in. 
The overall display field as projected was 
12 in. square. 

The colors used on the various displays 
were red, green, blue, orange, and either 
white (on slides with a black background) or 
black (on a white background). Visual 
matches of the colors as projected with 
standard colors resulted in agreement by 2 
Os on the descriptive specifications shown in 
Table 1. 


Because 


colors 


rear projection was used, a 
moderately high ambient illumination was 
maintained, by diffuse overhead lighting from 
dimmed fluorescents: over .5 ft-c as measured 
at the experimental console. 

The experimental routine for S began when 
E signaled to him the first two digits of the 
target number, the number he was to find on 
the display. These two digits were displayed 
on an auxiliary panel. The S also saw on this 
panel either a colored indicator showing what 
color the target number would be, or else a 
statement “color unknown.”” The E then 
exposed the slide on the screen in front of S 
and started a clock. The S searched for the 
target number, indicated when he had found 


TABLE 1 


Display CoLcors Usep (MuNSELL Notation) 


White Back 
ground 


Black Back 


Display Color ground 


Red 

Green 

Blue 

Orange 
Black/White 


ee 


"R 7/10 
0 


TABLE 2 


Displays Usep or Eacu Typt 


—— | Number of Displayed Items 


Colors Dis- | - _ 
played | 


| 80 


5 


it by pushing one of 10 numbered buttons, 
corresponding to the third digit of the target 
number. If this response was correct, the 
clock stopped, a chime sounded, and the 
screen went blank. The E recorded the time, 
and then set up the auxiliary display panel to 
begin the next trial. In the very rare case 
when S pushed the wrong response button, a 
loud buzzer sounded, and the trial was run 
again later in the experimental series. 

The displays were presented in a series of 
150 slides, half with white backgrounds and 
an identical group with black backgrounds. 
Each group of 75 slides contained both single- 
and multicolored displays with display 
densities varying from 20 to 100 three-digit 
numbers (which will be called ‘‘items”’ in the 
next few paragraphs to avoid confusion). 
The types of displays used are summarized in 
Table 2. The entries in the matrix represent 
the number of different displays used. 

There are really three sets of displays 
represented in the matrix. The upper row 
consists of single-colored displays of increasing 
display density, with 5 displays (one of each 
color used) at each density level. The lower 
row consists of a comparison set of displays, 
each with items of all five colors on it. It was 
arbitrarily decided to use 5 displays at each 
density level, to match the first set. The 
diagonal set of the matrix represents those 
displays where an increasing number of dis- 
played colors is associated with increasing 
display density, with the advantage that a 
constant number (20) of items of any par- 
ticular color are displayed amid varying 
numbers of other-colored items: 10 two- 
colored displays were used, representing all 
the possible pairs of the five colors available ; 
10 three-colored displays represented all the 
possible triple combinations; and 5 four- 
colored displays represented the cases where 
each of the five available colors had been 
omitted. 

The series of 150 displays described above 
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were presented in a random order, which in 
turn was randomized (in five blocks) differ- 
ently for different Ss. Each S went through 
this series twice in the course of the experi- 
ment, for a total of 300 searches. The first 
time through, for half of the displays (ran- 
domly chosen) 
color of the target number, the number-to-be- 
searched-for. For the other half of the dis- 
plays, the color of the target number was 
unknown to him. During his second run 
through the series, these conditions were re- 
versed for each particular display—where the 
color of the target had been known beforv, it 
was now unknown, and vice versa. 

In summary, visual search time data were 
obtained from 12 Ss, each viewing a series of 
300 displays, which varied in display density, 
in number of colors used, in the particular 
color of the target, with either a white or 
black background, under conditions where 
S either knew the color of the target number 
in advance or did not. 


RESULTS AND DISCUSSION 


Because of the relative complexity 
of the experimental design, no single 
statistical treatment would suffice to 
answer all the questions we might 
wish to raise. In fact, five separate 
analyses of variance were made, each 
using some portion of the data, in 
order to examine the effect of different 
combinations of the 
variables. 

The 


variance, and one basic to the sub- 


experimental 


most extensive analysis of 
sequent data analysis, was carried out 
using individual search times obtained 


under all conditions involving either 


single-colored or five-colored displays. 


These data represent a factorial design 
of 12 Ss by 2 prior 
knowledge (target color either known 
or unknown) by 2 


conditions of 


types of display 
background by 


density by 2 types of display (either 


5 degrees of display 


five-colored) by 5 
the 


Taken together they com- 


single-colored or 
possible colors used for target 
number. 
prise 2400 measures. 


Phe data initially available for this 


SIDNEY 


he was told in advance the 


SMITH 


analysis were search times expressed 
Because of the inherently 
high correlation between mean and 
variance in search time data, a log 
transform was used prior to the vari- 
computations. The 
computations themselves were carried 
out by an electronic computer, and 
sums of squares were obtained for the 
six experimental variables and their 57 
various interaction terms, including 
the six-way Following a 
procedure suggested by Edwards 
(1950) for the treatment of repeated 
measures obtained from the same Ss, 


in .O1 sec. 


ance analysis 


residual. 


all sums of representing 


interactions of Ss with other experi- 


squares 


mental variables were pooled to form 
residual representing the Ss 
x Conditions interaction. The mean 
square of this interaction was used as 
the error term to test the significance 
of all others. (As it happens, if Ss had 
been treated as a legitimate variable 
in the analysis, none of the interac- 


one 


tions involving Ss would have proved 
Because of 
the large number of F ratio compari- 
made in this 
significance levels of at least .001 were 
accepted as persuasive evidence of 
statistical reliability. The summa- 
rized results of this analysis 
presented in Table 3. 

We may note, first, that neither the 
particular target color used, nor the 
display background, nor any inter- 
action term including these variables, 
had any statistically significant effect. 
We might have expected, for example, 
that some colors on a light back- 
ground would be less legible than on a 
dark background, and hence more 
difficult to scan quickly, since it is 
clear that visual contrast 
portant variable in legibility. How- 
ever, the present data suggest that we 
need not expect any measurable effect 
different 


statistically significant. ) 


sons analysis, only 


are 


is an im- 


related to contrast ratios 
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TABLE 3 


ANALYSIS OF VARIANCE OF TRANSFORMED 
SEARCH TIMES FOR ALL EXPERIMENTAI 

CONDITIONS INVOLVING ONE- 

Five-CoLorep Displays 


AND 


Source of Variance 


Se 
Conditions 
Knowledge of target color (1 1 21.787 
Display background 0.301 
Display density (D 16.791 
Number of colors (N 14.956 
Color of target (( 0.063 
B ! 0.015 
D 0.619 
N 15.549 
( 0.106 
D 0.071 
N 0.000 
0.095 
0.129 
( 0.129 
( 0.292 
igher-order interact 
among conditions 
Ss X Conditions 


“xxKxxKxXK KX 


ZIT SSeS TA77 


0.096 
0.095 


*® None was significant wher 
*P < OO 


that are all well above some sort of 
legibility threshold. 

Next, consider the marked effect on 
search time of display 
density, confirmed by the statistical 
analysis and illustrated in 
degrees by Fig. 1-3. 


increasing 


varving 
It is the nature 


of these data that there is greater in- 
herent variability of the search time 
measure in those situations where the 
Taking 
the 


closely, 


average search takes longer. 
this into account, it seems that 
data 
approximate a direct linear relation 


with display density 


average search time 
his was also 
the case in the Green and Anderson 
study and has been noted before by 
other investigators (e.g., Green, Me- 
Gill, & Jenkins, 1953) 

If the in Fig. 1 are extra- 
polated backward to a display density 
of zero, they the 
value of about 1 
sec., which presumably represents the 


“curves” 
would intersect 
ordinate axis at a 


simple button-pushing reaction time 
This is confirmed by the observation 
that only one or two of the individual 


search time measures obtained in this 


AND VISUAI 


SEARCH 


study were less than 1 sec., and those 
were smaller by only a slight margin 
The same observation was made by 
Green (1956) 
similar circumstances. It 


under 
should be 
pointed out that their reported search 
times were in 


and Anderson 


somewhat 
the 
In part, this is because they 


general 


shorter than those in 


study. 


present 


reported geometric rather than arith- 
metic And, in part, this 
difference is probably attributable to 
the fact that their display field 
12 X 163 in., viewed from a distance 
of 10 ft.) subtended a smaller visual 
angle than was the case in this present 
study. 


means. 


Hence, fewer eve movements 
were required to scan their displays. 
The significant interaction between 
knowledge of target color, and number 
of colors displayed, the 
potential value of color coding when 
the the 
task. This is illustrated by the sizable 
difference the 
Fig. 1. If the color coding permitted 
absolute visual separability 


indicates 


color is relevant to search 


between curves in 
between 
classes of displaved items, then the 
average search times for these five- 


FIVE - COLOR =OISPLAYS 





sfarcr T 


AVERAGE 





OISPLAY DENSITY 
(NUMBER OF THREE -DIGIT ITEMS) 


Fic. 1 
display 


Search 


density 


time as a tunction 


with knowledge ol irge 


color as a parameter 





438 


color displays would be only one- 
fifth as great when the target color 
was known as when it was unknown. 
In actuality, the difference was not 
quite so great as this, but considerable 
nonetheless. 

Comparable curves for  single- 
colored displays showed no such dif- 
ference, i.e., knowing the target color 
beforehand did not speed the sub- 
sequent visual search process. In- 
deed, there is no logical reason to 
expect that it would. However, to 
check this, a separate variance anal- 
ysis was conducted, similar to that 
already described, but using only the 
search times for single-colored dis- 
plays. The only significant variable 
in this treatment turned out to be 
display density. Prior knowledge of 
the target color made no difference. 

A comparable variance analysis was 
run comparing one- and five-colored 
displays, but using only the search 
time data obtained when Ss had no 
prior knowledge of target color. 
Again, the only significant variable 
proved to be display density. In 
particular, number of colors displayed 
made no difference to Ss when they 
had no knowledge of the target color 
to guide them. This lack of difference 
is illustrated by the overlapping 
curves in Fig. 2. 

This last finding would seem to bear 
directly on the conclusion by Green 
and Anderson that multicolored dis- 
plays retard visual search when target 
color is unknown, and in fact, is 
evidence that no such effect occurred 
in this present study. It should be 
noted, however, that their conclusion 
was based on somewhat different 
evidence. Data obtained under equi- 
valent conditions in this present study 
are summarized in Fig. 3. The dis- 
plays considered here are those which 
always contained 20 items of the same 
color as the target number. For the 
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FIVE-COLOR DISPLAYS 
( TARGET-COLOR UNKNOWN) 
—————E———————————— eee 


0s) 
oe 


6 


| 
SINGLE -COLOR DISPLAYS 
(COMBINED , DATA ) 


o 


AVERAGE SEARCH TIME ( SECON 











OrSPLAY OENSITY 
( NUMBER OF THREE-OIGIT ITEMS) 


Fic. 2. Search time asa function of display 
density under conditions of nonrelevant color 
coding and of no color coding (single-colored 
displays). 


displays with just one color, this is 
all they contained. For the two- 
colored displays, there were also 20 
items of the second color present. For 
the three-colored displays, there were 
20 items displayed for each of the two 
other (nontarget) colors—and so on, 
for the four- and five-colored displays. 


NUMBER OF DISPLAY COLORS 
2 3 





(SECONDS) 


TARGET COLOR 
UNKNOWN 


AVERAGE SEARCH TIME 








OISPL AY OENSITY 
( NUMBER OF THREE DIGIT ITEMS ) 


Fic. 3. Search time as a function of dis- 
play density when target color is known and 
unknown and when there are 20 items of 
target color in all cases. 
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Phis comparison in Fig. 3, showing the 


reduced search times 


when the target color was known to 


considerably 


Ss, is essentially the comparison made 
their 
It is true that the number of 
displaved colors is here confounded 
with 


by Green and Anderson in 


report. 
differences in display density. 
But in the sense that each display 
contains just 20 target numbers amid 
“clutter,” the 
comparison it permits is interesting. 


varying amounts of 


lo permit a direct check of the 
and Anderson (1956) results, 
a further analysis of variance was run, 


(Green 
comparing search times for single- 
colored displays, of 40, 60, 80, and 100 
with 
and 


three-, 
display s, ol 


items, those for two-, 


four-, five-colored 


corresponding respective densities, 


under conditions of no prior knowl- 


edge of target color Since neither 


display background nor target color 
had proved to be significant variables 
in preceding 
eliminated in 


analvses, these were 
this analvsis by 
individual 


these varial les. 


Ssul- 


ming search times across 
Thus the data used 
consisted of the sum of 10 search times 
2 display back- 


grounds) under each combination of 


5 target colors by 


display density and number of colors. 

\ctually, 

the 
displays, 


entirely 
the 


this is not true, 
for in two-colored, 
40-item the three- 
colored, 60-item displays, there were 
20 search times available (10 target 
color/other color combinations by 

display backgrounds). 

half the available data 


Cade of 


and 


In these cases, 

were chosen 
randomly to use in the variance anal- 
This total data then, 
represents a factorial design of 12 
Ss by 4 degrees of display 
by 2 types of displays 


ysis. subset, 
densit\ 
single-colored 
versus multicolored—for a total of 96 
measures. 

The 


summarized in 


results of this analysis 


Table 4. The 


are 
only 


AND VISUAI 


SEARCH 


rABLE 4 


\NALYSIS OF VARIANCE OF SUMMED SEARCH 
liMES FOR CONDITIONS INVOLVING SINGLI 
AND MULTICOLORED DISPLAYS WITH 
No Prior KNOWLEDG! 
or TARGET COLOR 


Source of Variance 


Ss 

Conditions 

Number of colors (N 
Display density (D 
N XD 

Ss X Conditions 


0.90 
001 


variable with a statistically significant 
effect proves 
density. 


again to be display 
There is no reliable differ- 
ence that can be attributed to number 
of displayed colors. If such a differ- 
had been confirmed, it would 
have been in favor of the multicolored 
displays, which were searched in an 


ence 


average time of 5.9 sec., as compared 
with 6.5 sec. for the single-colored 
displays. Certainly we cannot con- 
clude that multicolored displays were 
distracting in this present study, even 
though the target color was unknown. 


Why did Green and Anderson obtain 
a different result? 
explanation lies in their choice of projec 


rhe most probable 


tion technique: luminous colored num 
bers displayed in a dark environment 


would provide a stimulus situation very 


conducive to perception of a depth 
illusion it’ 


differences in eit'er 
If their « 
played numbers of different colors ap 


based on 
brightness or wave length. 
2 £ 


peared to be in slightly different frontal! 
planes, this might have led their Ss to 
scan a multicolored display several times 
a process that on the average would b 
somewhat slower than one systematic 
search of a single-colored display In 
this present study, with rear projection 
and higher ambient illumination, the 
displayed symbols clearly lay on a single 
surface, depth 
apparent. 


and no effects were 
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The slight but regular increasé in 
search time, when the target color was 
known, as more and more numbers of 
other colors were added to the display 
(the lower curve in Fig. 3), also proved 
to be statistically reliable. The data 
involved in this demonstration consist of 
the search times for each individual S, 
summed over target color and display 
background as before, for displays of one, 
two, three, four, and five colors, with 
display densities of 20, 40, 60, 80, and 
100 items, respectively, under conditions 
where the target was known in 
advance. As in the previous analysis, 
only half of the available data for the 
two-colored, 40-item displays and the 
three-colored, 60-item displays was used. 
This represents a factorial combination 
of 12 Ss by 5 display densities (each 
representing a different number of dis- 
played colors), for a total of 60 measures. 
Analysis of variance confirmed a reliable 
difference among the 5 display types at 
P< .01 (F = 4.37; df = 4/44). This 
confirms the conclusion of Green and 
Anderson, based on a similar analysis, 


color 


that wrong-colored items can be almost 
completely ignored in 
almost, but not quite. 


visual search— 


SUMMARY 


Twelve Ss each viewed a series of 300 
displays, which varied in display density, in 
number of colors used, in the particular color 
of the target, with either a white or black 
background, under conditions where S either 
knew the color of the target in advance, or 
did not. 

Neither the particular color of the target 
nor the display background had any signi- 
ficant effect on search time. Search time 
increased regularly with increasing display 


SIDNEY L. 


SMITH 


density. For multicolored displays, when 
the color of the target was known in advance, 
search times were considerably shorter than 
when the target color was unknown. When 
the color of the target was unknown, search 
times were not significantly different than 
those for single-colored displays. 
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The discrimination hypothesis 
(Humphreys, 1940; Mowrer & Jones, 
1945) ascribes the partial reinforce- 
ment effect (PRE) to the relative ease 
of discriminating an abrupt transition 
to extinction following regular rein- 
forcement as compared to the more 
gradual transition following partial 
reinforcement. 

If the recent conditions of rein- 
forcement provide the basis of the 
discrimination, then the PRE should 
be decreased by the interpolation of 
regular reinforcement prior to extinc- 
tion since this would make the transi- 
tion between recent training and ex- 
tinction abrupt. This was examined 


for the case of the free operant re- 
Keller (1940), by Likely 


sponse by 
(1958), and by Quatermain and 
Vaughan (1961). Although in every 
case the results were that the inter- 
polation of regular reinforcement 
failed to reduce resistance to extinc- 
tion, no firm conclusions can be drawn 
since none of these experiments de- 
monstrated a clear PRE for partial 
versus regular reinforcement taken 
separately. Theios (1962) remedied 
this deficiency in a runway experi- 
ment. The PRE was not significantly 
altered by the interpolation of 
regular reinforcements, and it was 
still clearly present, although signifi- 
cantly reduced, after 70 regular rein- 
forcements. 

The present experiments use the 
pigeon’s key peck response with a 
discrete trial procedure. They were 
carried out independently of Theios’ 
work and afford some of the same 


5 


comparisons. In Exp. I, the location 
of a period of partial reinforcement 
within a longer regime of training 
under regular reinforcement was 
varied so that the effect of the amount 
of regular reinforcement which oc- 
curred between partial reinforcement 
and extinction could be observed 
where the total amount of training 
was the A baseline was 
provided by a control group which 
received all reinforcement. 
The results suggested that the inter- 
polation of regular reinforcement prior 
to extinction may increase resistance 
to extinction over the level obtained 
when extinction occurs directly fol- 
lowing partial reinforcement. Ex- 
periment II was designed to sub- 
stantiate this finding, to further define 
the conditions under which it 
and to remove certain ambiguities in 
the interpretation of Exp. I. 


same, 


regular 


CcCUrs, 


METHOD 


Subjects.—The Ss were 5—6 yr. old, male 
White Carneaux pigeons without prior ex- 
perimental history. They were maintained 
by restricted feeding at 80% of their free- 
feeding body weight. 

Apparatus.—An automatic key pecking 
apparatus of Skinner design was used. De- 
tails of the apparatus have been reported 
previously (Jenkins, 1961). Reinforcement 
was a 4-sec. period of access to a tray of mixed 
grain signaled by lighting the opening to the 
tray. 

Trials.—A trial was begun by lighting the 
The trial 
was terminated (key light turned off) by a 
single response, or by external control at the 
whichever occurred first. On 
reinforced trials, the tray operation followed 
the response without delay. the 


translucent plastic response key 


end of 5 sec., 


During 
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interval between trials the key light was off 
but S’s compartment remained illuminated 
The time between onsets of successive trials 
was equally often 15, 30, or 45 sec. in 
irregular order. A response made between 
trials prevented the onset of the next trial 
for 60 sec. 


an 


A second response occasionally 
occurred closely following the first response 
toa trial. On trials in which the first response 
initiated an operation of the grain tray, the 
second response released the tray and thus 
prevented rhe 
sponse always placed the 60-sec. 
effect 
Two measures were recorded on each tria! 
the occurrence or nonoccurrence of a 
response within the 5-sec. trial period, and 
(b) the latency response in tenths of a 
second. No latency was entered for trials on 
which S failed to respond.  Intertrial 
sponses (ITRs) were also recorded. 
Shaping of response —The Ss were trained 
strike the lighted key 
successive approximations. When several 
automatically reinforced key responses had 
occurred, intertrial intervals were introduced. 
\ discrimination between the lighted and un- 
lighted key was formed immediately so that 
responses were confined almost entirely to the 
trials from the beginning of training 


second 
delay into 


reinforcement. re- 


(a) 
ol 


re- 


to by reinforcing 


Two 
sessions were, with a few exceptions, sufficient 
to prepare Ss for the automatically programed 
training. 
Sessions.—The experiments involved pri 
training, training, and extinction 
Preliminary training consisted of four sessions 


liminary 


(Exp. 1) or five sessions (Exp. I1) of regular 
reinforcement. In each session, 40 reinforced 
trials were programed. The period of access 
to the grain tray was reduced from 5 to 4 se 
in the course of these sessions. 

Iraining sessions were under regular or 


partial reinforcement. In sessions of regular 


reinforcement 40 reinforced trials were pro 
gramed as in preliminary training. 
of partial reinforcement consisted of 40 rein- 


Sessions 


forced trials plus a number of nonreinforced 

trials programed in a random sequence subject 

to the constraint that each half the se- 

quence contained half of the reinforced trials 
0) and half of the nonreinforced trials 


of 


Extinction sessions were run on consecutive 
days and each consisted of 
By prior 
among groups 


40 nonreinforced 
trials decision, all comparisons 
the first 10 


Additional sessions were 


were based on 


extinction sessions 


run in the case of most groups in order to 


more nearly complete extinction. 
Treatment Exp. I.—The 


| each received 13 sessions of 


groups, four 


groups of Exp 


M 
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training beyond preliminary training. Group 
13R received 13 sessions of regular reinforce- 
ment. The remaining three groups received 
partial reinforcement for a block of 3 con- 
secutive sessions located in different positions 
within the sequence of training sessions 
These groups are designated 3P-10R (3 ses- 
sions of partial reinforcement followed by 10 
sessions of regular reinforcement), 9R-3P-1R, 
and 10R-3P. Forty nonreinforced trials were 
programed in the first session of partial 
reinforcement, 60 in the second, and 80 in the 
third. 

Five Ss were assigned to each group so as 
to match the groups as well as possible on the 
basis of the means and variance of the latency 
of response during preliminary training. 

Treatment groups, Exp. II.—Three groups 
received different numbers of training sessions 
under partial reinforcement: Groups 8P 

N =9), 20P (N =7), and 32P (N = 7). 
lwo groups received different amounts of 
training under regular reinforcement: Groups 
8R (N = 6) and 20R (N 6). Finally, two 
groups received different amounts of training 
under partial reinforcement prior to 12 
sessions of regular reinforcement: Groups 8P- 
12R (N and 20P-12R (N =6). The 
reasons for selecting conditions will 
become clearer after the results of Exp. I are 
reported, but certain features may be noted 
here. Partial reinforcement was begun at the 
same point in training 
training), 


= j = 


these 


(directly following pre- 
rather than at different 
points as in Exp. I. The amount of training 
was extended in order to obtain a cleat 
separation in resistance to extinction between 
groups given partial reinforcement as against 
those given regular reinforcement. The de- 
sign also allows comparison of the effect on 
extinction of adding, after different amounts 
of training under partial reinforcement, either 
more partial reinforcement, or an equal 
number of sessions of regular reinforcement. 
The groups in Exp. II were run in the fol- 
lowing order: Groups 8P and 8R concur 
rently, then Groups 20P, 20R, and 8P-12R 
concurrently, then Groups 32P and 20P-12R 
oncurrently. 


liminary 


Groups run at the same time 
were matched on the basis of performance in 
preliminary training as in Exp. | 

In each of the first two sessions of partial 
reinforcement, 40 
programed. Thereafter, each 
partial reinforcement contained 80 nonrein- 
forced trials. 

Additional _ ses extinction.—All 
groups in Exp. | and Groups 20R, 20P, and 
8P-12R Exp. Il received 2 additional! 
sessions of extinction. Groups 20P-12R and 


nonreinforced trials were 


session of 


ions o} 


ol 
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32P received 4 additional sessions of extinc- 
tion. In Group 32P 1 S was given 5 and 
another was given 11 sessions beyond Session 
10. With the additional sessions of extinction, 
there was at least 1 session for every S in 
which 10 or fewer responses were made. The 
effect of including responses made in sessions 
beyond Session 10 on the relative standing of 
the groups as determined by the first 10 
sessions was examined and is reported below. 


RESULTS 


Responses in training.—The pro- 
portion of trials responded to was 
maintained at very high levels 
throughout all phases of training. 
The mean proportion based on both 
experiments was .976 for preliminary 
training and .985 for training. There 
were no systematic differences in this 
measure among groups or between 
the periods of partial reinforcement 
and regular within 
groups. 

Shape of extinction curves. 
trial method of the 


reinforcement 


The 


discrete present 


RESPONSES 


GROUP 20R 
(LEFT SCALE) 
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experiments yields extinction curves 
which typically show a sharp drop 
from a high to a low level of respond- 
ing. Individual extinction curves are 
shown for Ss in Groups 20R and 20P 
in Fig. 1. Although the average 
number of responses in extinction was 
about three times greater in Group 
20P than in Group 20R, it appears 
from Fig. 1 that the curves differ 
principally in the points at which 
failures to respond begin, rather than 
in the abruptness of the 
thereafter. 


decline 


In order to examine the abruptness 
of extinction more systematically, 
two criterion sessions designated A 
and B were located for each 5S. 
Session A was the last one in which 
30 or more responses were made and 
Session B was the first one in which 
10 or fewer were made. Figure 
Il the 


numbers) of the 


2 plots 


for each group in Exp. 
locations 


mean 
(session 


GROUP 20P 
(RIGHT SCALE) 








SESSIONS 


Responses in extinction for individual Ss of Groups 20P and 20R 
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Fic. 2. The average slopes of extinction curves for the groups of Exp. II. (The criterion 


sessions A and B are explained in the text. The bars represent one SD of the location of 
Sessions A and B, or one-half SD to either side of the mean. 


The numbers in parentheses are 
one SD of the number of sessions between Sessions A and B.) 


A and B. On the 
ordinate is plotted the mean number 
of responses in the criterion sessions. 
The slope of the lines connecting A 
with B thus represents, on the aver- 
age, the abruptness of the extinction 
curves. The slopes are very similar. 
The SD of the location of the criterion 


criterion sessions Responses in extinction, Exp. I. 

Summary data for responses in 10 
sessions of extinction are given for 
Exp. | and II in Table 1.' On the 
assumption that PRE is reduced as a 
function of the amount of regular 
reinforcement which is interpolated 
prior to extinction, the groups of 
sessions increases as the mean number Exp. I would be ordered on the basis 
of extinction sessions prior to the of number of responses in extinction 
“break point’? increases, a trend as follows: 10R-3P > 9R-3P-1R 
which is correlated with partial rein- > 3P-10R > 13R. However, the 
forcement in training. On the other groups which received partial rein- 
hand, variability in the number of forcement actually show 
sessions between A and B shows no 1 The inclusion of responses which occurred 
systematic differences among the in sessions of extinction beyond the tenth 
session resulted in only minor increases over 
the totals given in Table 2 except in the case 
of Groups 20P and 32P, where the totals 
increase about 10%. Statistical tests in- 
volving Groups 20P and 32P were recomputed 
using the total of responses for all extiaction 
sessions. The significance levels in each case 


the reverse 


groups. 

Since the shapes of the extinction 
curves similar for the several 
groups, the major effects of the train- 
ing variables can be represented con- 


are 


veniently by the mean 
responses in extinction. 


number of 


remained unchanged from those obtained on 
the basis of the first 10 sessions of extinction. 
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ordering: i.e., 10R-3P < 9R-3P-1R 
< 3P-10R. An analysis of variance 
for these groups yielded an F = 3.40 
with 2/12 df,and P < .10. The mean 
difference between the extreme groups 
was 78.4 responses (¢ = 2.60; df = 8, 
P < .05). 13R, which re- 
ceived no partial reinforcement had 
the lowest mean number of responses 
in extinction. However, meaningful 
statistical comparisons of this group 
with the others are difficult to make 
since 1 S in Group 13R resumed 
responding late in extinction and thus 
made a total number of responses 
exceeded by only 2 other Ss in the 
entire experiment. The remaining 4 
Ss in Group 13R had the four lowest 
totals in the experiment. 

The following hypothesis was for- 
mulated from the outcome of Exp. | 
and tested in Exp. II: The interpola- 
tion of regular reinforcement between 
partial reinforcement and extinction 
does not reduce the PRE, but in fact 
increases resistance to extinction over 
that which obtains when extinction 
follows partial reinforcement directly. 
Experiment | falls short of establish- 
ing this point on two counts. First, 
a clear PRE was not obtained when 
extinction followed partial reinforce- 
ment directly (i.e., Groups 10R-3P 


Group 
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and 13R were not well separated). 
In order to obtain a clear PRE against 
which to evaluate the effect of inter- 
polated regular reinforcement, the 
amount of training under regular or 
partial reinforcement was extended in 
Exp. II. Second, although the hy- 
pothesis ascribes the greater resistance 
to extinction in Group 3P-10R com- 
pared to Group 10R-3P to the regular 
reinforcement which followed partial 
reinforcement, the difference between 
these groups could also be ascribed to 
differences in the amount of regular 
reinforcement which preceded the 
partial reinforcement. In Exp. II, 
partial reinforcement was introduced 
at the same point in training in order 
to avoid this ambiguity. 

Responses in extinction, Exp. I]. 
A comparison of Groups 8P, 20P, and 
32P of Exp. II shows that resistance 
to extinction was a function of the 
amount of training under partial 
reinforcement (F = 12.18; df = 2/12, 
P < .001). 


The extension of training 
from 8 to 20 sessions resulted in a 


significant increase in resistance to 
extinction (¢ = 4.16; df = 14, 
P < .001), but no further increase 
resulted from the further extension of 
training from 20 to 32 
we < Sp. 


sessions 


rABLE 1 


MEANS AND SDs o+ 


Experiment I 
Type of 
Training 
Mean 
13R 100.0 
Regular 
10R-3P 118.4 
Partial 


Partial 9R-3P-1R 
then 


Regular 3P-10R 


RESPONSES IN 10 SESSIONS OF 


EXTINCTION: Exp. | AND II 


Experiment II 


8R 
20R 


8P 
20P 
32P 
8P-12R 


20P-12R 
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A comparison of Groups 8R and 
20R shows, on the other hand, that 
when training was under all regular 
reinforcement, resistance to extinction 
was unchanged by the extension of 
training from 8 to 20 sessions (t < 1). 

A clear PRE emerged only after 
extended training. Groups 8R and 8P 
did not differ significantly (¢ = 1.64; 
df = 13, P < .20) while Groups 20R 
and 20P were clearly § different 
(¢ = 7.60; df = 11, P < .001). 

Consider next Group 8P-12R which 
was switched from partial to regular 
reinforcement prior to extinction. 
Resistance to extinction for this 
group was significantly greater than 
for Group 8P (¢ = 2.91; df = 14, 
P < .02) showing that the addition of 
regular reinforcement increased re- 
sistance to extinction. The increase 
obtained from adding 12 sessions of 
regular reinforcement to 8 sessions of 
partial reinforcement was less, but 
not significantly less, than the increase 
resulting from the addition of 12 
sessions of partial reinforcement, i.e., 
the means for the number of responses 
in extinction in Groups 8R-12P and 
20P could not be reliably distinguished 
(¢ = 1.79; df = 12, P = .10). 

When 20 sessions of partial rein- 
forcement were given, the addition of 
12 sessions of regular reinforcement 
(Group 20P-12R) produced no fur- 
ther increase in resistance to extinc- 
tion. The mean number of responses 
to extinction was in fact less, although 
not significantly less, in Group 20P- 
12R than in either Groups 20P or 32P 
(20P-12R vs. 20P: ¢ 1.42, df = 11, 
P < .20; 20P-12R vs. 32P: ¢ 
df 11, P < .20). 

The results of Exp. | and II are 
brought together in Fig. 3 which plots 


1.30, 


the mean number of responses in ex- 
tinction as a function of the amount of 
training under regular reinforcement, 
the amount of training under partial 
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reinforcement, and for the mixed 
conditions in which partial reinforce- 
ment is followed by regular reinforce- 
ment. Experiment I has been co- 
ordinated to this plot by ignoring 
differences in the amount of regular 
reinforcement which preceded the 
introduction of partial reinforcement. 
When treated in this way, the out- 
come of Exp. I falls into line quite 
well with that of Exp. II, so that it 
seems reasonable to believe that varia- 
tions in the amount of training under 
regular reinforcement preceding the 
introduction of partial reinforcement 
made little difference. 

The independent variables in Fig. 3 
are the number of training sessions of 
regular or partial reinforcement be- 
yond the sessions of preliminary 
training. Thus, the dotted lines 
extrapolated to zero sessions of train- 
ing represent a guess that preliminary 
training alone would produce a resist- 
ance to extinction equal to that ob- 
tained after eight additional sessions 
of regular reinforcement. 

Relation of latency in training to 
responses in extinction.—The amount 
and type of training had only minor 
effects on the mean latency of response 
for groups. There were in fact no 
significant differences in the mean 
latencies of response taken over the 
last three sessions of training in either 
experiment. The overall mean la- 
tency for these sessions was 1.4 sec. 
However, it was possible to detect in 
Exp. II a small effect of partial 
reinforcement when compared to regu- 
lar reinforcement for the first eight 
training sessions. The mean latency 
for partially reinforced Ss (N = 36) 
was .2 sec. longer than for regularly 
reinforced Ss (N = 12). A test of this 
difference using a covariance analysis 
with latency during preliminary train- 
ing as the predictor yielded ¢ = 2.34; 
df = 45, P < ,05, The switch from 
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20P - 12k, 





20 16 12 
+— SESSIONS OF REGULAR REINFORCEMEN 


> 


hic. 3. Mean number of respo 


of the amount and type of training beyond preliminary training 


preceding partial training in 
[see text ] 


partial to regular reinforcement 
caused no significant change in latency 
in either experiment 
the overall 


parent that the effects of the amount 


From results it is ap- 
and type of training on resistance to 
paralleled by 
latencies of 


were not 
group 
response in training. 

The correlation between 
within 
to extinction is 


extinction 
effects on mean 
latencies 
and 
more in- 


for individuals groups 
responses 


teresting since it pro rides a clue as to 


ses in extinction 


Exp. | have beer 
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RESPONSES IN EXTINCTION —> 





ind i] as 


(Sessions of regular training 


or all groups (Exp. | function 


disregarded and are shown in parentheses 


how the addition of training under 


a limited 
under partial 


regular reinforcement after 
amount of training 


reinforcement operates to increase 


resistance to extinction. A sub- 
stantial 


latencies 


negative correlation (longer 
with 
sponses in extinction) was found in the 
10R-3P (Exp. 1) 
These were the 
directly to 
period 
under partial reinforcement that was 


associated fewer re- 


case of two groups: 
and 8P (Exp. II). 


only groups which went 


extinction after a_ training 
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not sufficiently extensive to produce a 
maximum resistance to extinction. 
In Group 10R-3P the correlation was 
— .79, and in Group 8P it was also 
— .79. In the first case the correla- 
tion was based on 5 Ss and was not 
significant while in the latter case it 
was based on 9 Ss and was significant 
(¢ = 3.35; df = 7, P < .02). In all 


other groups, the correlations were 
low and did not approach significance. 


The pattern of correlations reflects 
the following state of affairs. When 
extinction was begun immediately 
after a relatively short period of 
training under partial reinforcement, 
slow responders extinguished more 
rapidly than fast responders. When, 
on the other hand, a period of regular 
reinforcement was added, resistance 
to extinction for slow responders was 
at the same average level as for fast 
responders. It is important to note 
that the effect of adding regular 
reinforcement was not to alter the 
mean latency of response, but rather 
to remove the correlation between 
latency and resistance to extinction. 
The extension of under 
partial reinforcement removed 
the correlation in the same way. 


training 
also 


DISCUSSION 


The abruptness of the transition from 
training to extinction is obviously not the 
critical functional difference between 
regular and partial reinforcement. When 
extensive training is first given under 
partial reinforcement, its effect persists 
through a_ period of training under 
regular reinforcement involving 480 rein- 
forcements. 

Although 480 reinforcements is a large 
number, it is still a small fraction of the 
2,320 training trials under partial rein- 
forcement which were used in Group 
20P in order to obtain a clear PRE. It 
therefore seems possible that the in- 
significantly lower resistance to extinc- 
tion for Group 20P-12R in comparison 
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with that of Group 20P or 32P would 
become a reliable difference with more 
extensive regular training. The point is 
important in connection with one inter- 
pretation of how overtraining in a dis- 
crimination task facilitates the learning 
of a reversed discrimination (Birch, Ison, 
& Sperling, 1960; Capaldi & Stevenson, 
1957). 

Interpolating a period of regular rein- 
forcement between partial reinforcement 
and extinction can increase resistance to 
extinction over the level which obtains 
when extinction partial rein- 
forcement directly. The results suggest 
the rule that added training under regular 
reinforcement increases resistance to ex- 


follows 


tinction only if added training under 
partial reinforcement would also increase 
resistance to extinction. It appears that 
Ss with relatively long latencies of re- 
sponse during the first segment of partial 
the 


from 


training gain most in resistance 


to extinction subsequent rein- 
forcement. 

This unanticipated result requires a 
reformulation. Instead of thinking of 
the partial training as producing a PRE 
which might or might not be attenuated 
by subsequent regular reinforcement, it 
now appears that the PRE can be gener- 
ated by the training conditions in Groups 
3P-10R and 8P-12R. The present interpre- 
tation of the PRE in these groups is that 
it arose from an interaction of the effects 
of the nonreinforced trials during the 
partial training with 
reinforcement 


the subsequent 
during regular training. 
It is clear that the effect depends upon 
an interaction without a_ prior 
exposure to nonreinforced trials, resist- 
ance to extinction was unchanged by the 
extension of training under regular rein- 
forcement. Amsel’s (1958) account of 
the PRE, which holds that the combina- 
tion of conditioned frustration due to 
nonreinforcement and subsequent rein- 
forcement develops a tolerance for non- 
reinforcement, provides one conjecture 


si nce 


9 


as to how this interaction is mediated.” 


2 Results on the occurrence of ITRs suggest 
that frustration was associated with extinc- 
tion During training, the 


overall mean 





RESISTANCE 


Other results 
indicate that 
between 


(Weinstock, 1954, 1958) 
some form of interaction 
nonreinforcement and_ rein- 
forcement can occur across long time 
intervals (at least 24 hr. separated the 
partial from the regular training in the 
present experiments), and even when the 
reinforcements occur consecutively in a 
block (Jensen & Cotton, 1960; Lauer & 
Carterette, they did in the 
regular training of the 
periments. 


1957) as 


present ex- 


The present results agree with those 
of Keller (1940), Likely (1958), Quater- 
main and (1961), and more 
specifically 1962), in de 
monstrating the persistence of the PRE 
through a period of regularly reinforced 
training. Further, Theios’ 
small but 


Vaughan 


with Theios 


finding of a 
significant reduction in the 
PRE for the group which received more 
extensive regular reinforcement (Group 
P-70) is paralleled by the present finding 
of a lessened resistance to extinction in 
Group 20P-12R. 

The fact of 
extinction for the partial-regular training 
sequence is 


increased resistance to 
a departure, although in 
dications of it can be found in previous 
results. In particular, Theios obtained a 
small increase in resistance to extinction 
(not significant) when 25 regular rein- 
forcements added _ to 
partial training. He considered the 
possibility that it from the 
regular reinforcement, but rejected it on 
the grounds that no increase 
when regular reinforcement 


were previous 


resulted 


occurred 
was added 
However, 
the present results show that when the 


to previous regular training. 


effect is obtained, it arises from an inter 
action of partial and regular reinforce 
ment; a possibility which is overlooked 
in Theios’ argument. 

Two implications of the present results 
First, the 
extinction 


for theory may be noted 


increased resistance to pro 
frequency of ITRs was .5 per 100 trials 

extinction it increased in every group yielding 
2.2 per 100 trials Phe 
y of IT Rs showed a peak value 
in the vicinity of the 


an overall mean of 
mean trequen 
decline of responses it 


extinction 


rO EXTINCTION 
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duced by the partial-regular sequence 
poses a problem for the dissonance 
theory of the PRE (Lawrence & Fest- 
inger, 1962) since that theory holds that 
the processes producing the increased 
resistance to extinction occur primarily 
on the nonreinforced trial itself. Second, 


the remarkable degree of independence 
between the slope of the extinction curves 
and the total number of responses in ex 


tinction indicates that something like a 
threshold model of extinction is prefer- 
able, at least for conditioning procedures 
of the type used here, to a decremental 
model in which an initial probability of 
response is reduced on each trial by an 
amount proportional to the number of 
responses remaining. Models of the 
latter type lead to exponential decays in 
which the slope of the curve is correlated 
with the total number of responses in 
the curve. 


SUMMARY 


\ food reinforced, key peck response in the 
was with a discrete trial pro- 
resistance to extinction 
number of responses in 10 40-trial sessions 


following different 
types: 


pigeon used 


cedure to study 


amounts of training of 
partial reinforcement, regular 
reinforcement, and partial followed by regular 
reinilorcement 

Resistance 
tematic 


three 


to extinction showed no sys 


change as a function of the amount 
of regularly reinforced training. On the other 
hand, it increased as a function of the amount 
of partial training up to 20 sessions and then 
leveled off at a value which clearly 
strated the PRE 
after training 


demon 
to extinction 
partial-regular 
sequence was never significantly lower than 
for partial alone. In fact, the addition of 
regular reinforcement increased resistance to 


Resistance 


under the 


extinction when the amount of prior training 
under partial was not sufficiently extensive to 
produce a maximum. 

From these results and those of previous 
experiments the conclusion is well established 
that the abruptness of the local transitior 
between and extinction is not 
critically involved in the PRE 


training 
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PARTIAL REINFORCEMENT, CONTINUOUS REIN- 
FORCEMENT, AND REINFORCEMENT 
SHIFT EFFECTS! 


STEWART | 


1. HULSE 


Johns Hopkins University 


Hulse (1962) has shown that partial 
reinforcement (PRF) of running in a 
straight alley produces sharp dis- 
crimination of reward stimuli in the 
goal box, but continuous reinforce- 
ment (CRF) does not. Partially rein- 
forced rats learned to lick from 
drinking tube much faster than con- 
tinuously reinforced rats, and the 
number of licks they emitted on non- 
reinforced running trials rapidly de- 
creased to a very low level. Taken as 
a whole, the data indicated that 
behavior in the goal box was much 
more critically focused on the reward 
and its stimulus properties if PRF as 
compared with CRF were used. 


a 


There is an important implication 


of these experimental results. During 
the course of training with PRF, 
changes in the stimulus properties of 
the reward ought to be sharply dis- 
criminated. For example, the 
assumptions that sweetness is a valid 
stimulus continuum for a reinforcing 
substance (Guttman, 1953) and re- 
sponse rate is positively correlated 
with sweetness, shifts from low to high 
concentrations of saccharin rein- 
forcer ought to produce prompt and 
direct increases in response rate. 
Discrimination of a stimulus change 
of this sort, and appropriate changes 
in response rate, ought to occur to a 
much lesser degree, if at all, with 
CRF. This would be true since CRF 


on 


a 


‘The research reported in this paper was 
supported by National Science Foundation 
Research Grants G8712 and G18125. The 
author is indebted to M. A. Berkley and E. J 
Appelman for collecting the data 
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does not appear to bring behavior 
under the stimulus control of the 
reinforcer to the extent that PRF 
does. 
The data reported here constitute 
a test of this implication. In one 
experiment, rats were conditioned on 
low and high concentrations of sac- 
charin with PRF and CRF. Then, 
for half the rats, reinforcement con- 
centrations were switched from low to 
high or from high to low. Ina second 
experiment, essentially the same pro- 
cedure was used, except that rats 
were switched only from low to high 
concentrations, and more extreme 
differences in concentration were used. 
Instrumentally conditioned _ licking 
was used as a response. This response 
is particularly appropriate, since the 
tongue is intimately involved in the 
sensory dimension of taste, and since 
licking rates vary quite consistently 
with the concentration of a sweet 
reinforcing stimulus (Hulse & Bacon, 
1962). 
EXPERIMENT | 
Method 


Subjects —The Ss were 64 naive male 
albino rats, 85 days of age, of the Sprague- 
Dawley strain obtained from Sprague- 
Dawley, Incorporated, Madison, Wisconsin. 
rhe Ss had no experience drinking from tubes 
prior to their use in the experiment. 

Apparatus and procedure.—The apparatus 
and the general procedure for treating licking 
as an instrumental response have been de- 
scribed in detail elsewhere (Hulse, 1960; 
Hulse, Snyder, & Bacon, 1960). In brief, the 
apparatus consisted of a 6.5 K 10 X7 in. 
wooden box which was suitably lighted, 
ventilated, and sound-shielded. A piece of 
j-in. Plexiglas was located in front of a hole 
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The Plexiglas had a 
1 X 2 cm. vertical slot cut into it. The S had 
access through the slot to a drinking tube 
Each lick on the drinking tube operated an 
relay which operated, in turn, a 
programing circuit. Liquid reinforcements of 
a specified volume were delivered to S through 
the tube 
operated by 


in one wall of the box. 


electronic 


means of an infusion 


the 


by pump 
For partial 
schedules of reinforcement, the programmer 
also operated a solenoid-driven lever. The 
lever slightly squeezed the pressure tubing 
leading from the pump to the drinking tube. 
\fter the reinforcement the 
tubing returned to its normal shape and 
produced 


programmer, 


was delivered, 


a slight negative pressure in the 
fluid system. The negative pressure drew the 
fluid column about 1 mm. inside the tip of the 
drinking tube. This assured that S could not 
make contact with the fluid on nonreinforced 
licks 

lwo drinking tubes were used in the experi 
During the first 4 days of training 
brass tube with a 2-mm. fluid hole was used 
On Day 1, this tube projected through the 
slot in the experimental box; on Days 2 to 4, 
the tube was gradually withdrawn through 
the slot. On Day 5, a plastic nipple with a 

mm. fluid hole was introduced behind the 
slot and used for the rest o 


nent 


the experiment, 
Chis tube hada ;¢-in. diameter brass electrical 
contact located just below the fluid hole 

The experimental design that was used 
during initial training incorporated two sched- 
ules of reinforcement (CRF and FR8 
two concentrations of reinforcement l 
and 10 gm. of saccharin added to 1 | 
water). 


and 
gm 
ol tap 
Sixteen Ss were randomly assigned 
to each of the four groups called for by this 
design. After initial training, the concentra 
tions of reinforcement were switched for half 
the Ss with each schedule of rein 
torcement both the CRE FRS 
reinforcement, the postswitch 


trained 
For and 
schedules of 
design thus included one group switched from 
the low to the high concentration, one group 
switched from high to low, and two control 
groups which continued on their initial con- 
centrations (N = 8 in each group 

Upon receipt from the breeder, Ss were put 
ad lib 


available from 


in colony cages and maintained on 


lab chow and water 
10 ren 
On Davy 1 of 

transferred to individual cages and placed on 
a daily deprivation diet of 10 gm. ground 
Purina lab chow with 20 cc of tap 
water. They on this 
schedule the remainder of 


Cat h 


Purina 


cans) for davs 


followed 


davs of taming 


taming, Ss were 


mixed 
stayed deprivation 
the 
Ss 


throughout 


experiment \lso, on taming dav, 
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were taken in groups of approximately 15, 
freely handled, and allowed to explore in a 
large wooden box for 30 min. They received 
their daily ration of wet mash in their home 
cages after this procedure was completed. 
Following taming, all Ss received 22 days 
of initial conditioning of the instrumental 
licking response. On Day 1, each S was 
placed in the apparatus and permitted 300 
licks from the brass tube on CRF. The 
of reinforcement variable 
introduced immediately. The size of each 
reinforcement 005 cc fluid. The 
plastic nipple was introduced on Day 5. 
he number of licks permitted all Ss and the 
for the partial Ss 
gradually increased so that by Day 7, all Ss 
made 1,000 licks, and by Day 18, the partial 
Ss were on their final FR8. 
continued through Day 22 


con- 
centration was 


was of 


ratio of reinforcement 


This regime was 


On Day 23, the concentration of saccharin 
for Ss in the switched groups was shifted from 
high to low or from low to high, depending 
upon the concentration that was used during 
initial training. This regime was continued 
from Day 23 through Day 29. On Day 30, 
Ss in the switched groups were shifted back 
to their original concentrations and run under 
The Ss in the 
their 
36 


this regime through Day 36 
initial 
days of 


control groups remained 


concentrations 


on 
throughout the 
training 

On 
Each day, Ss were placed in the apparatus 
and allowed to lick the dry tube for 3 min 
the those 
prevailing during training, except that the 
pump 


Day 37, 3 days of extinction began. 


\ll conditions remained same as 


and solenoid-driven lever were dis- 


connected from the programmer 
The Ss were fed their daily 
mash 20 to 30 min 


ration ol wet 


after they had been run 
The amount of water added to the mash was 
adjusted to account for the fluid obtained in 
the apparatus 

During training, the total time that each S 
required to emit its 1,000-lick allotment was 
These times were 
transformed to reciprocals and multiplied by 
1,000 to give a measure of average licking rate 
in licks per second. During extinction, the 
total that each S emitted 
during the daily 3-min. extinction period was 


recorded for each day 
number of licks 
rec orded 
Results 


Preswitch 


By 
the last training day before the 
first switch in saccharin concentration, 


performance e. 


Day 


») 





REINFORCEMENT 
CRF produced faster licking 
than PRF, and the high concentration 
of saccharin produced faster licking 
rates than 


rates 


the low concentration of 
An of 
22 for Ratio and Concentra- 
Reinforcement yielded an / 
for of 15.83 (df 1/60, 
P < .O1) and an F for Concentration 
of 59.11 (d/ 1/60, P < .01). The 
interaction between the two variables 
was not significant (P > .05). 

First switch. 
switch in 


saccharin. analysis variance 
for Day 
of 


Ratio 


tion 


The effect of the first 
concentration, which oc 
curred on Day 23, was quite different 
for CRF and PRF. Figure 1 shows 
that with PREF an orderly and appro- 
priate change in rate 
Day 22 to Day 23. 


from a high to a low concentration, 


from 
lf the switch was 


Oo curred 


If the switch 
was from low to high, licking rate in- 
With CRE, the 
hand (Fig. 2), licking rates decreased 

the the 


switch in concentration. 


licking rate decreased. 


creased on other 


regardless of direction of 
Statistical analvses of these results 
First, 
variance used 
the that were 
switched in concentration (Table 1) 


were carried out in two ways. 


analvses of were on 


means for groups 


These analyses were done separately) 
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Fic. 1 


reinforcement 


Effect of shifts in cor 
for PRF in Exp. | 
occurred between Days 22 and 23 
tween Days 29 and 30 


centration ol 
Shifts 


be- 


and 


SHIFT E 














DAY 


concentration 
(Shifts 
ind be 


Fic. 2. Effect of shifts iv 
reinforcement for CRF in Exp. I 
occurred between Days 22 2 


and 23 
tween Davs 29 and 30 


f 
| 


for the CRF and PREF conditions, and 


they and postswitch 
Second, analyses 


of variance were used on 


included _pre- 
days as a variable. 
means for 
both the control and switched groups 
(Tables 2 and 3). These were done 
separately for both the CRF and PRI 
conditions and for Days 22 and 23. 
Table 1 shows a highly significant 
Treatments X Days interaction for 
the PRF condition. By ¢ test, the 
licking rate of the PRF High-Switched 
group decreases from Day 22 to Day 


rABLE 1 


\NALYSES OF VARIANCE OF SWITCHED-GROUP 
LICKING RATES DURING TRAINING 


lreat- 
ments 
(T) & 0.21 
krror 0.70 
Days (D 0.01 
Ss X¥ D 0.30 
rxpD 2.64** 
Error 0.13 


0.84 
0.75 
0.05 
0.26 
2.65** 


0.09 


*P < Ol 
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rABLE 2 


ANALYSES OF VARIANCE OF LICKING RATES 
DURING TRAINING FOR PRF Groups 


Source 
Day é F Day Day 

22 2: 30 36 
Concentra- 

tion (C) 
Switch (Sw) 
Cc X Sw 
Error 


12.50**| 7.70**| 6.57**| 7.22**| 5.36** 
0.01 0.38 |0.94 |0.67 | 3.06* 
0.01 1.17 1.95* |0.36 | 0.43 
0.43 |0.54 (0.34 (042 /|0.62 


*P < .05. 
=P < Ol. 


23 (t = 6.36, df 14, P < .01), and 
the licking rate of the PRF Low- 
Switched group increases from Day 22 
to Day 23 (t = 2.27, df = 14, P < .05). 
For the CRF condition, Table 1 shows 
that only Days is a significant vari- 
able. Licking rates on Day 23 are 
much lower than on Day 22, and this 
is true regardless of the direction of 
the switch in concentration. Tables 2 
and 3 show that for both PRF and 
CRF on Day 22 only the Concentra- 
tion variable is significant. High con- 
centrations produced faster licking 
rates than low concentrations. On 
Day 23, however, the pattern of 
significant effects is quite different for 
PRF and CRF. For PRF, the Con- 
centration variable once again pro- 
duces the only significant effect. 
Licking rate is correlated with the 


rABLE 3 


ANALYSES OF VARIANCE OF LICKING RATES 
DURING TRAINING FOR CRF Groups 


source 


Day 
30 


Concentration 

(C) 1| 8.92**| 0. 1.62 
Switch (Sw) 1/0.16 
C X Sw 10.31 
Error 28) 0.32 


4 Q7** 

** 0.10 0.13 
0.32 | 0.09 
0.45 | 0.34 


*P < 01 
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concentration used on Day 23 for both 
switched and control groups. For 
CRF, only the Switch variable is 
significant. Licking rates are gen- 
erally lower for the switched groups 
than they are for the control groups. 
By ¢ tests, both the High-Switched 
and the Low-Switched groups differ 
from the High-Control group 
(t's = 2.72 and 2.83, df = 28, Ps < .05 
and .01, respectively), but neither 
switched group differs significantly 
from the Low-Control group 
Ps > .05). 

Terminal performance on the con- 
centrations of the first switch was 
quite different for PRF and CRF. 
For PRF on Day 29 (Fig. 1), licking 
rates for the Low-Switched group 
reached about the same level as those 
of the High-Control group. Licking 
rates for the High-Switched group, 
however, showed an increase following 
Day 23 and reached an asymptote 
that was much higher than that of the 
Table 2 shows 


Low-Control group. 
significant effects for both Concen- 
tration and the interaction between 
Concentration and Switch in Concen- 


tration. By ¢ test, the Low-Switched 
group does not differ from the High- 
Control group (P > .05), but the 
High-Switched group licks signifi- 
cantly faster than the Low-Control 
group (¢ = 2.90, df = 28, P < .01). 
The results for CRF (Fig. 2) show a 
pattern of differences among the 
groups which is similar to that for 
PRF, but the magnitudes of the 
differences are much smaller. Table 3 
shows that for CRF none of the ex- 
perimental variables produced signifi- 
cant effects on Day 29. 

? Analyses of variance for the last 3 days 
of the first and second switches revealed no 
significant effects due to Days or the inter- 
action of Days with any of the other variables. 
The data thus provide no evidence that the 


groups had not reached asymptotic perform- 
ance by Day 29 or by Day 36. 
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Second switch—With one miayjor 
exception, the effect of the second 
switch in concentration on Day 30 
was quite similar for both PRF and 
CRF. Figures 1 and 2 show that the 
licking rates of the switched groups 
changed in the appropriate direction 
when these groups returned to their 
original training concentrations. This 
was true regardless of ratio of rein- 
forcement. The extent of the change 
in rate, however, was quite different 
for PRF and CRF. With CRF, 
Fig. 2 shows that the switched groups 
reached licking rates which were es- 
sentially the same as the control 
groups. With PRF, however, Fig. 1 
shows that the Low-Switched group 
changed to a performance level which 
was considerably above that of the 
Low-Control group. By Day 36, the 
PREF High-Switched group also re- 
sponded at a faster rate than the 
High-Control group. 

Table 1 shows a significant Treat- 
ments X Days interaction for the 
switched groups for Days 29 and 30 
for both PRF and CRF. Tables 2 and 
3 show a significant Concentration 
effect on Day 30 for both PRF and 
CRF, but none of the other variables 
are significant. On Day 36, however, 
there is a significant Switch effect for 
PREF in addition to significant Con- 
centration effects for both PRF and 
CRF. Although both PRF switched 
groups performed above their controls 
on Day 36, the difference between the 
Low-Switched and the Low-Control 
group is the only one that reaches 
significance (t 2.16, df 28. 
P < .05). 

Extinction.—F ig 3 shows that, 
in general, the Pk: groups emitted 
more licks on the dry drinking tube 
than the CRF groups. Moreover, 


with PRF, the switched groups were 
far more resistant to extinction than 


With CRF, 


the nonswitched groups. 


SHIFT EFFECTS 





LicKS 


NUMBER OF 











Fic. 3. Resistance to extinction in Exp. 
| for PRF and CRF Switched and Control 


groups. 


however, the data suggest that the 
switched groups were 
to extinction 
groups. 


less resistant 
than the nonswitched 


An analysis of variance on mean 
number of licks for all groups across 
the 3 days of extinction shows signifi- 
cant effects for Ratio of Reinforce- 
ment (F = 70.48, df = 1/56, P < .01), 
Days of Extinction (F = 102.93, 
df = 2/126, P < .01), Switch in Con- 
centration (F = 5.78, df = 1/56, 
P < .05), Ratio X Switch (F = 9.10, 
df = 1/56, P < .01), and Ratio 
xX Days (F = 41.46, df = 2/112, 
P < .01). For PRF, the mean differ- 
ence between the switched and non- 
switched groups is significant (t= 6.61, 
df = 56, P < .01). For CRF, this 
difference is not significant (¢ = 0.76, 
df = 56, P > .05). 


EXPERIMENT I] 


Experiment II was run to check on 
the generality of some of the phe- 
nomena which resulted from the first 
switch of Exp. |. Experiment II 
incorporated more days of training 
before the switch took place and used 
more extreme differences in the con- 
centration of the 
forcement. 


saccharin rein- 
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Method 


Subjects. The Ss 
albino rats, approximately 75 days of age, of 
the Sprague-Dawley strain. 

Apparatus and procedure.—The apparatus 
was identical to that used in Exp. I. Simi- 
larly, the 
feeding of Ss 


were 30 naive male 


details of housing, taming, and 
were essentially the 
those of Exp. | \ somewhat 
daily deprivation regime was adopted: 9 gm 


of Purina chow mixed with 18 cc of tap water 


Same as 
more severe 


lhe experimental design that was used was 
identical to that of Exp. | except that no 
High-Switched groups were run. The post- 
switch design thus included six groups with 
an N of 5 in group 
concentrations were either 


each Reinforcement 
50 gm. or 10 gm. 
of saccharin added to 11. of tap water. As 
before, ratio of reinforcement was either CRI 
or FR8. The size of each reinforcement was 
0043 ce of fluid 

Initial training in Exp. Il was identical 
to that of Exp. I with ‘the following excep- 
Only 600 licks were permitted each 
rhe PREF Ss were on their final FR8 
Day 16, and all Ss continued 
initial training through Day 32. The PRF 
Ss thus received 17 days of training on FR8 
instead of the 5 days that the PRF Ss received 
in Exp. |. Following initial training, concen- 
tration of reinforcement was changed from 
low to high on Day 33 for the PRF and CRF 
Switched groups, and the Ss continued on the 


>” 


new regime through Day 37 


tions 
day. 
schedule by 


Results 


The immediate effects of the switch 
from a low toa high concentration are 
qualitatively the 


exactly same as 
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shift occurred between Davs 32 ind 33 


Fic. 5. 


remlorcement tor 


those obtained in Exp. I. Figure 4 
that with PRF, the licking 
rates of the Switched group increased 
markedly from Day 32 to Day 33. 
the data 
Fig. 4 with those shown in 
suggests that the increase in rate was 
considerably greater under the condi- 
tions of Exp. II than urder those of 
Exp. |. Figure 5 shows that with 
CRF, licking rates of the Switched 
group declined, as they did in Exp. I. 


shows 


Comparison of shown in 


Fig. 1 


An analysis of variance used on the 
groups means of Days 32 and 33 for 
PRF shows the following effects to be 
significant: Treatments (/ 14.76, 
df = 2/12, P < .01), Days (F = 4.66, 
df = 1/14, P < .05), and Treatments 
x Days (F 7.49, df = 2/12, 
P< .01). The significant Treat- 
ments effect occurs primarily because 
of the large and consistent difference 
in performance between the High- 
Control and the Low-Control groups, 
while the significant Treatments 
xX Days effect reflects the sharp 
change in performance of the Switched 
group from Day 32 to Day 33. An 
analysis of variance for CRF shows 


Treatments to be the only significant 


effect (F = 12.61, df = 2/12, P < .01). 
A third analysis of variance, which 
compared the performance of the 
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CRF and PRF Switched groups on 
Days 32 and 33, shows a significant 
Days X Ratio of Reinforcement inter- 
action (F = 9.14, df 1/8, P < .05). 
Chis corroborates the results shown by 
the separate analyses for PRF and 
CRF regarding the effects of the 
switch in concentration. 

Following the first day of the 
switch in concentration, the perform- 
ance of the CRF and PRF Switched 
groups was quite similar. Both 
groups reached a licking rate which 
was between the rates of the High- 
Control and Low-Control groups. 
An analysis of variance of means 
obtained by averaging rates on Days 
36 and 37 shows a significant effect 
due to Treatments (F = 18.86, 
df = 2/22, P < .01) and to Ratio of 
Reinforcement (F = 15.57,df = 1/22, 
P < 01). The interaction between 


these two variables is not significant. 
By ¢ test, the Switched groups re- 
sponded significantly faster than the 


Low-Control groups (¢ = 3.62,df = 22, 
P < .01) and significantly slower than 
the High-Control groups (f = 2.50, 
df = 22, P < 05). 


DISCUSSION 


Phe licking behavior that occurs on the 
day of a sudden change in the concentra 
tion of the reinforcer indicates that rats 
who are trained with PRF are condi 
tioned to respond to the reinforcing 
stimulus in a way which is quite different 
from rats who are trained with CRI 
With PRF, the sweetness and other 
stimulus properties of the reinforcer are 
sharply defined for S. This happens be- 
cause of the contrast in stimulus condi 
tions which occurs when responses art 
sometimes followed by a drop of sac 
charin and sometimes followed by noth 
ing. If the concentration of the saccharin 
suddenly changes, the change is rela- 
tively easy to detect, and response rate 
immediately increases or decreases in the 
direction of the change. In effect, be 
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havior comes under critical and orderly 
control of the stimulus properties of 
the reinforcer through discrimination 
training. 

A different situation prevails when 
CRF is used. Here, each successive 
response is followed by the same sweet 
stimulus, and there is no stimulus con- 
trast from one response to the next. 
With CRF, S learns to attack the drink- 
ing tube in a way which is vigorous, but 
relatively speaking, blind. If the con- 
centration of the saccharin reinforcer 
changes, stimulus generalization decre- 
ment prevails, and behavior is markedly 
disrupted. Pure generalization decre- 
ment will hold the first time that S meets 
a sudden change in concentration, but 
not, of course, on later occasions. With 
the first change, S receives new informa- 
tion about reinforcement conditions and 
is, in fact, trained through contrast to 
discriminate somewhat more about the 
stimulus properties of the reinforcer. 
A second change in concentration comes 
as less of a surprise to S, and as Exp. | 
shows, response rates immediately in- 
crease or decrease appropriately. 

The behavior that appears on the days 
which follow a change in reinforcement 
concentration, and the behavior that 
appears during extinction, indicates that 
experience with a particular set of rein- 
forcement conditions on PRF produces 
residual effects which have a long-lasting 
The dis- 
crimination training that PRF provides 
apparently yields a learned connection 
between licking behavior and a particular 
stimulus property of the reinforcer, such 
as its sweetness. 


influence on response strength. 


First, licking rates for the PRF 
switched groups reach postshift asymp- 
totes which often lie between the licking 
rates that are characteristic of the high 
and low saccharin. 
The Ss behave much as if they were 
averaging the intensities of the pre- and 
postswitch reinforcing stimuli. 


concentrations of 


Stimulus 
averaging of this sort seems to be most 
easily demonstrated when concentrations 
are changed from high to low (cf. Fig. 1). 
On the first postswitch day of the second 
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change in concentration of Exp. I, for 
example, the response rate of the PRF 
Low-Switched group falls sharply. But 
it remains well above the response rate 
of the Low-Control group. In Exp. II, 
a compromise in rate is also obtained 
when concentrations are changed from 
low to high. The PRF Switched group 
increases its licking rate on the first 
postswitch day, but then the rate falls 
to an asymptote which is again about 
half way between the rates of the control 
groups. Sometimes, stimulus averaging 
and comprises in response rate also ap- 
pear with CRF. But when they do, they 
are far less easily obtained than they are 
with PRF. In Exp. I, there is no indica- 
tion of a consistent compromise in 
for CRF. However, in 
Exp. II, which involved more training 
days and larger differences in reinforce- 
ment 


response rate 


concentration, a compromise in 
response rate is clearly obtained for CRF 
as wellasfor PRF. Apparently the more 
extreme conditions of Exp. II are re- 
quired, however, before a compromise in 
rate will appear with CRF and instru- 
mentally conditioned licking. In this 
connection, Premack and Hillix (1962) 
trained rats to lick 4% and 16% sucrose 
solutions from a conventional drinking 
tube, a procedure analogous to CRF 
conditioning as defined here, and then 
shifted the concentration of sucrose for 
these rats to 32%. The licking rates on 
the new concentration were consistently 
lower than the rate of rats maintained on 
32% sucrose throughout the experiment. 
Apparently, experience with a particular 
reinforcing stimulus on CRF can make 
that stimulus have long-lasting effects on 
behavior under some conditions. The 
important point is that, for the present 
at least, these conditions appear 
well defined for CRF than they do for 
PRF. 

Second, if experience with a particular 
set of reinforcement conditions on PRF 
produces residual effects which pet 
manently influence behavior, extinction 
should provide a sensitive test for these 
This would 


less 


effects. be true since ex- 


tinction removes S from direct exposure 


H. HULSE 


the primary reinforcing stimulus. 
The data suggest that the control of 
behavior by the reinforcing stimulus is 
indeed permanent and that it does trans- 
fer to extinction. First, the extinction 
data of Exp. I show that a past history 
of shifts in the concentration of the 
reinforcer serves to markedly increase 
resistance to extinction when PRF is 
used. Further, Hulse (1958), Wagner 
(1961), and Hulse and Bacon (1962) 
have shown that with PRF resistance to 
extinction increases as amount of re- 
inforcement increases. This is true 
whether amount of reinforcement is 
defined in terms of weight of food or 
sweetness of saccharin solutions. 

None of these things are true when 
CRF is used. In Exp. I the CRF groups 
that had a past history of shifts in 
concentration of the reinforcer were, if 
anything, resistant to extinction 
than the control groups. In other ex- 
periments which have used CRF and 
varied amount of reinforcement, the ex- 
tinction results are at best unpredictable. 
Sometimes 


to 


less 


resistance to extinction in 
amount of reinforcement 
increases (Zeaman, 1949), sometimes it 
decreases (Armus, 1959; Hulse, 1958), 
and sometimes it does not vary with 
amount of reinforcement at all (Hulse & 
Bacon, 1962). Since S no 
discrimination training for the reinforcer 
with CRF, it is perhaps not surprising 
that these inconsistent results should be 
obtained. The data suggest that with 
CRF, S is stimulus bound. Response 
strength will be consistently correlated 
with stimulus property of the 
reinforcer as long as S is directly exposed 
to it, but the reinforcer 
remove it—and the correlation vanishes. 

Finally, none of the data in either 
experiment reveals the slightest trace 
of elation, depression, positive contrast, 
negative (Pubols, 1960). 
The only time that a switched group 
licks either faster slower than a 
control group, in a way which might 
suggest a contrast effect, occurs after the 
initial change in concentration of Exp. I. 
Here the rate of the CRF High-Switched 


creases aS 


receives 


some 


change —or 


or contrast 


or 
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group drops below that of the Low 


Control immediately 


bounces back to a point which is higher 


group, but it 


than the Low-Control group. As we 
have seen, it seems most meaningful in 
the context of the present experiments to 
view changes in rate of this sort as due 
decrement. 


to stimulus generalization 


In the same connection, it is interesting 


to note that most of the significant con 


trast effects that | 


are reported in the 
literature are negative contrast effects 
1956: Pubols, 1960). 


the earlier experiments have used CRF, 


(Spence, Since all 
perhaps generalization decrement was at 
least as important as some emotional- 
motivational factor in determining the 


contrast effects that were obtained. 


SUMMARY 


Instrumentally conditioned licking was 
studied in two experiments as a function of 
ratio of reinforcement and shifts in the con- 


With 


i shift in con 


centration of a saccharin reinforcer. 
partial reinforcement (PRI 
entration produces an immediate change i 
licking rate in the direction of the shift 
With continuous reinforcement (CRF), the 
immediate reaction to the shift is always a 


With PRF, 


reached several 


decrease in rate 


rates 


response 
isymptotic response 
days after a shift are often compromises be 
tween the rates of control groups maintained 
With CRi 
a compromise of this sort appears only whe 


relatively 


on high and low concentrations 


large amounts of training and 
relatively extreme differences in concentratior 
of reinforcement are used. 

The data suggest that, with PRF, behavior 

more critically and permanently under the 
control of reinforcement stimuli than with 
CRI This happens because PRF provides 
discrimination training for 
but CRI 


reinforcement 


stimuli does not 
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Problem solution sometimes re- 
quires the integration of units of be- 
havior that are already in the reper- 
toire of S but have not previously been 
used in conjunction with one another. 
When the efficiency of integration 
increases gradually as a function of 
the number of trials the process is 
variously described as trial-and-error, 
instrumental, or selective learning. 
When an efficient solution occurs on 
the first trial, without any preceding 
trial-and-error behavior, the process is 
variously designated as reasoning, in- 
sight, or inference. The present ex- 
periment is one of a series that seeks to 
explore the relationship between these 
two processes in children (Kendler & 
Kendler, 1956; Kendler, Kendler, 
Pliskoff, & D’Amato, 1958; Kendler & 
Kendler, 1961). These studies have 
used an experimental paradigm, de- 
rived from Hull (1935, 1952), in which 
S is trained on three discrete behavior 
segments (A-B, X-Y, and B-G) and 
then presented with a test situation in 
which S is instructed to get G when 
only A and X are available. Problem 
solution requires the assembly of A-B 
and B-G. 


The purposes of the present study 


were (a) to determine whether the 
ability to infer, as measured by these 
particular operations, increases with 
age and (b) to analyze the role of the 


his research was sponsored by the 
National Science Foundation. The authors 
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research. 
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SUBGOAL 


CONSTANCY ! 


HOWARD H. KENDLER 


New York University 


integrating B stimulus in the infer- 
ential process. 


\IETHOD 


Experimental design.—There were four ex- 
perimental groups, 32 Ss in each, arranged in 
a 2 X 2 factorial design. The main effects 
were age, kindergartners (K) vs. third-graders 
(3rd), and the constancy of training and test 
subgoals, constant (C) vs. switched (S). 

Subjects.—The Ss were 137 children drawn 
from the Le Conte Public School in Berkeley, 
California. Nine Ss were eliminated, 5 of 
them because they made simultaneous A and 
X choices, and 4 due to inadvertent errors in 
experimental procedure. The data to be re- 
ported are based on the remaining 128 
children, of whom 63 were boys and 65 girls. 
The kindergartners’ mean age was 68.7 mo., 
range from 61 to 74. The third-graders’ 
mean age was 103.9 mo., range from 96 to 115. 
Within each age level Ss were randomly 
assigned to C and S groups without regard 
to age or sex. 

A pparatus.—The portable aluminum ap- 
paratus used consisted of three distinct square 
panels 17.5 cm. on a side. Each panel, which 
could be exposed to S’s view singly or in 
combination with the others, corresponded to 
one habit segment. The center panel pro- 
vided for the B-G segment. On its blue 
anodized surface was a circular opening 2.4 
cm. in diameter into which S could drop the 
objects that served as subgoals in this experi- 
ment, namely, a glass marble with a 1.8-cm. 
diameter and a steel ball bearing with a 1.3- 
cm. diameter. If S dropped the correct 
subgoal into the circular opening, a small, 
shiny, gold fairy-tale or nursery-rhyme charm 
was propelled to a trough near the bottom of 
the panel. The incorrect subgoal delivered 
no reward. The charms, e.g., “little Bo 
Peep,” ‘‘the cow that jumped over the moon,” 
“the gingerbread man,” etc., which were 
presented in a set sequence, served as the 
major goal (G). 

The two side panels corresponded to the 
A-B and X-Y segments. The left panel was 
anodized pink and the right anodized yellow. 
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Each panel was equipped with a button which, 
when pressed, closed a circuit that led to the 
delivery of the appropriate subgoal (glass 
marble or ball bearing) to a trough near the 
bottom of that panel. 

Procedure.—All Ss were run individually 
and completed in one experimental session. 
The session began with the administration of 
the Peabody Picture Vocabulary Test (Dunn, 
1959), which took from 5 to 15 min. and was 
followed immediately by training on the 
inference apparatus. One of the side panels 
was opened and S was told, “Press this button 
and see what happens.’’ When the subgoal 
was delivered, S was instructed to pick it up, 
look at it, then return it to E so that he might 
have another turn. This side panel was 
closed, the other side panel opened, and the 
procedure repeated. After S had one forced 
trial on each panel, the procedure was re- 
peated with the order of the sequences 
reversed. Thus at this point in training S 
had made two responses on the A-B and on 
the X-Y segments in an ABBA order. After 
these four forced trials, the doors of both side 
panels were opened and S was shown one of 
the subgoals and directed to, “Press the 
button that will get one like this."’ The 
procedure was repeated with each of the sub- 
goals presented in an ABBA-BAAB order 
until the criterion of six 
responses was reached. 

The next step of the training started with 
E opening the middle panel (after closing the 
two side ones) and directing the attention of S 
to a small window through which he could 
see a charm. He was told that it was a 
fairy-tale or nursery-rhyme charm which he 
would soon get an opportunity to examine 
closely. The aperture was pointed out and 
S was informed that if he dropped ‘‘the right 
thing’’ into that hole, the charm would drop 
into the tray. Then the marble and the ball 
bearing were placed in front of the B-G panel 
and E said, “One of these two little things is 
the right thing. Take one of them and drop 
it in the hole to see if it makes the charm come 
out.”” On the next trial S was again in- 
structed to drop in the one that would make 
the charm come out. After that, training 
proceeded until a criterion of four successive 
correct responses was reached. Ss who made 
two or less errors received no additional in- 
structions while those who committed three or 
more errors were again urged to drop in the 
one that would make the charm come out 
The relative positions of the marble and ball 
bearing (i.e., left and right) were varied in a 
random order after each correct response 
throughout the training on the B-G segment 


successive correct 
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After preliminary training the test trial 
was introduced with the following instruc- 
tions: “Would you like to see another charm? 
Very well, this time I won't put out any little 
things, but I will open all the doors. If you 
do what you are supposed to, you can make 
the charm come out. Go ahead.’’ The S 
was allowed 60 sec. in which to make any 
response that he chose. If he did not press 
the A or X button during this time, E said, 
“Which button should you press to help you 
get thecharm? Goahead.” After S pressed 
either button, he was allowed another 60 sex 
to complete the sequence by dropping either 
subgoa! into the B-G aperture before the trial 
was terminated. Thus all Ss made either an 
\ or X response and the trial was terminated 
either when S made a major goal response or 
when 60 sec. had elapsed since the subgoal had 
appeared. 

For half of the children at each age level 
the subgoals were switched (S) between the 
preliminary training and the test trial so that 
if S made an A choice, he obtained the Y sub- 
goal while if he made an X choice he obtained 
the B subgoal. For the remainder the sub- 
goals remained in a constant position (C) 
during training and testing. 

In order to minimize contamination from 
position and subgoal preferences, they were 
assigned in counterbalanced order: (a) Half 
of the Ss in each experimental group had the 
right panel serve as the A-B segment while the 
left panel served as the X-Y segment. The 
opposite arrangement applied to the remain- 
ing half. (6) For half of the Ss in each experi- 
mental group the marble was B and the ball 
bearing Y; for the remainder the ball bearing 
was B and the marble Y. 

At the end of the experimental session 
each S was asked to keep the game a secret 
until everyone had a turn. It should be 
emphasized that at no point in the training 
or test did E describe the subgoal by name 


RESULTS AND DISCUSSION 


PPVT 10 
scores for the four experimental groups 
were as follows: K-C (Kindergarten- 
Constant), 105.1; K-S, 106.8; 3-C 
(Third Grade-Constant), 110.5; and 
3-S, 108.6. A2 X 2 factorial analysis 
of variance applied to these data 
vielded no significant fs. The mean 
1O for all Ss was 107.8. 
Preliminary training.—All 
quickly acquired the 


IQ. scores —The mean 


groups 
subgoal seg- 
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ments, i.e., they learned which panel 
yielded the marble and which yielded 
the ball bearing. Eighty-four percent 
of all Ss attained criterion with no 
errors and 91% made no more than 
one error. No child made more than 
seven errors. The percentages of Ss 
who attained criterion with no errors 
were as follows, by groups: K-C, 84%; 
K-S, 94%; 3-C, 75%; and 3-S, 78%. 
A x? analysis of the corresponding 
frequencies revealed no statistically 
significant differences among. the 
groups. 

The major goal segment was also 
easily learned, i.e., whether the marble 
or the ball bearing yielded the charm. 
Except for 1 S who made nine errors 
and another who made four, all chil- 
dren reached the criterion after no 
more than three errors. The _ per- 
centages of Ss who attained criterion 
with no more than one error were as 
follows: K-C, 88%; K-S, 88%; 3-C, 


69%; 3-S, 84%. A x? analysis of 


the corresponding frequencies again 


vielded no. significant differences 
among the groups. It should be 
noted, however, that in the acquisition 
of both the subgoal and major goal 
segments the older Ss made more 
errors than the younger. This may 
be another manifestation of the re- 
sponse-shift-tendency reported by 
Harlow (1959). 

Test trial: initial choice.—The first 
component of inferential behavior is 
the initial choice between A and X, in 
which A is the inferential choice. 
The combined results, which appear 
in Table 1, show a. statistically 
significant age difference in the ex- 
pected direction (P < .01). In fact, 
on this measure, the kindergarten 
children perform precisely at chance 
level. 

This part of the results appears to be 
in conflict with two previous studies 
Kendler & Kendler, 1956; Kendler et 
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rABLE 1 


PERCENTAGE OF SS IN EACH EXPERIMENTAL 
Group WHoseE INITIAL Cuorce Was A 


Chronological Age Level 


Experimental Group 


50.0 
50.0 
50.0 


Subgoals switched 
Subgoals constant 
Combined 


al., 1958) which showed that in groups 
of children between 34 and 60 mo. of age 
there were significantly more A than X 
choices. The studies, however, differed 
from the present one in two ways. The 
task used was simpler and the Ss were 
from higher socioeconomic levels. A 
more recent study (Kendler & Kendler, 
1961) used a procedure and 

sample comparable to the 


socio- 
economic 
present ones and obtained similar results. 
In that study, which had as Ss children 
between 30 and 65 mo. of age, the number 
of A and X choices was almost exactly 
equal. It is therefore suggested that the 
apparent conflict in results is due to the 
roles played by the difficulty of the task 
and the intelligence of the Ss. To some 
extent this explanation of the discrep- 
ancy between the studies is supported by 
the findings of the present study (see 
Table 2) that the selection of A is related 
to mental age. It is evident that there 
was, in general, an increase in the per- 
centage of A choices with increasing MA. 
This is, however, an ad hoc analysis, not 
tested for statistical significance; con- 
sequently the generality of the results is 
limited. They do, nevertheless, support 
the implication of the CA results, namely 
that inferential behavior is symptomatic 
of an important developmental process. 


Test trial: integration response.—lf 
after his initial choice, S inserted a 
subgoal into the G aperture, he was 
considered to have made an integra 
tion response. If the subgoal thus 
utilized was B, the integration re- 
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rABLE 2 


PERCENTAGE OF A CHOICES AS A 
FUNCTION OF MA 


Chronological Age Leve 


5-6 Vr 8-10 Vr 


\ o; \ 


36-71 
72-95 
96-119 
120-143 
144-167 + 


40.0 |.098 
48.3 |.093 
66.7 |.192 
100.0 


100.0 


sponse was correct. If it was Y, the 
integration response was incorrect. 
Each of these integration responses 
could be further subdivided into those 
that occurred with no unnecessary 
responses intervening between initial 
and goal responses (direct) and those 
that occurred after one or more un- 
necessary responses (indirect). The 
intervening responses ranged from 
making only one unnecessary response, 
e.g., pressing the X button, to repeat- 
ing almost the entire training se- 
quence, e.g., pressing A but leaving B 
in trough, then pressing X and leaving 
Y in trough, then taking both sub- 
goals out and setting them into the 
same position they occupied during 
training of B-G, then finally taking up 
B to drop into the G aperture. (The 
number of responses in 
indirect be a fruitful 
subject for analysis in the future, but 
for the present the entire gamut was 
treated as an entity.) Finally, S had 
the option of making no integration 


unnecessary 


solutions may 


response at all. 

Table 3 presents the results for all 
Ss in the C groups, divided into these 
five categories. Since it might appear 
that the superiority of the older chil- 


dren ofgthis measure is merely a re- 
° ° . . 
flection of the greater proportion of 


correct initial responses, the results 
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were also analyzed using as a base only 
Ss who made an initial A choice, with 
the following effect. Twelve percent 
of the kindergartners and 67% of the 
third-graders who made an initial A 
choice made direct correct integration 
responses. This difference is signifi- 
cant (P < .001). 

There are several conclusions to be 
drawn from the data in Table 3. One 
is that, as was the case with the initial 
choice measure, the integration meas- 
ure yields marked differences in the 
performance of the two age levels. 
Note that the method used by those 
kindergartners who do attain a correct 
solution is primarily indirect. De- 
scriptively speaking, such inference 
was more like trial-and-error than 
insightful behavior. On the other 
hand, a very large majority of the 
older Ss made a correct integration 
response, and their solutions tended 
to be more direct than indirect. 

The other conclusions deal with the 
relations between the first and second 
components of the inferential se- 
quence. Since many Ss, particularly 
among the kindergartners, who made 
an Achoice either made no integration 
response at all or interposed other 
responses before completing the se- 
quence, it may be concluded that the 
stimulus presented by the ‘‘B subgoal 


rABLE 3 


PERCENTAGE OF SS AT EACH AGE LEVEL IN 
CONSTANT SuUBGOALS Groups WHO 
MapeE VARIOUS INTEGRATION 


RESPONSES 


Kind of Integratior 


No 
X-Y-G Inte 
Incorrect) gration 
Response 


\-B-G + 


Correct 


Direct | Indirect | Direct | Indirect 


5-6 


8-10 | 50.0 
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in its trough’’ does not necessarily 
produce the B-G behavior segment. 
On the other hand, it is also clear that 
when an integration response does 
occur it is much more likely to be a 
B-G response than a Y-G response. 
This result could be due to the pres- 
ence of the B stimulus since it was to 
this stimulus that the integration re- 
sponse was trained. However, in the 
Constant Subgoals groups the appear- 
ance of the B stimulus depends on an 
A response. The data have shown 
that, at least among the older Ss, the 
A choice is not a matter of chance. It 
must be due to some problem solving 
activity, probably some covert system 
of responses. It is therefore equally 
possible that the B-G integration may 
be attributed to the same response 
mechanism that led to the correct 
initial choice. 

In order to sort out whether the 
salient influence on the integration 
response is related to tlie correctness 
of the initial choice or to the correct- 
ness of the subgoal stimulus, the 
percentages of direct integration re- 
sponses under four different conditions 
are presented in Table 4. Two of 
these conditions are drawn from the 
data of the Constant Subgoals groups 
reported above. The other two con- 
ditions are drawn from the Switched 
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Subgoals groups whose second com- 
ponent results have not yet been 
reported. In this table R refers to the 
initial response and S to the subgoal 
stimulus. 

When the proportions of direct 
integration responses, correct (+) and 
incorrect (—), under the various 
conditions are compared for the two 
age levels, it is apparent that there are 
too few Ss in the kindergarten groups 
to yield any trends. The older Ss’ 
behavior, however, varies in an inter- 
esting and statistically significant 
way. Ax’analysis of the correspond- 
ing frequencies yields a P between 
.02 and .05. Third graders are most 
likely to integrate the two components 
when both their initial choice and the 
subgoal are correct. When one of 
these elements is incorrect, the prob- 
ability of an integration response 
decreases ; however, either element by 
itself does lead more than one-third 
of the older Ss to integrate. When 
neither element is correct, the prob- 
ability of integration is minimal. 

In order to interpret these results 
it is necessary to examine more closely 
the nature of these elements. It is to 
be expected that S+ is more likely to 
produce integration than S—. But 
from Table 4 it can be seen that S+ 
by itself is not sufficient to explain 


rABLE 4 


PERCENTAGE OF Ss AT EACH AGI 


Condition Prior 
to Integration 
Response 


Subgoals 


Constant 
Switched 
Switched 
Constant 


LEVEL WHO Mapi 
AS A FUNCTION OF THE CORRECTNESS OF THE INITIAL RESPONSE 


Type 
Integration 


RESPONSES 
AND SUBGOAL 


DirECT INTEGRATION 
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integration at either age level. Among 
the younger Ss very little integration 
occurs even under S+ conditions. 
Among the older Ss, S+ is important 
but is not as adequate by itself as 
when in combination with R+. In 
fact R+ is so important that it can 
effect some integration even when it is 
combined with S—. 

To understand why S+ by itself is 
not the adequate stimulus for integra- 
tion, it must be realized that it does 
not stand for the B subgoal in isola- 
tion. It actually symbolizes a total 
stimulus complex of “B in a trough 
of one of the side panels.’’ During 
training this particular stimulus com- 
pound was associated with picking up 
B and then returning it to E. There 
are quite a few Ss, particularly among 
the kindergartners, who, during the 
test situation, do just that. It is 
another stimulus compound, namely 
B and Y in front of the B-G panel, 
that is associated with picking up B 


and dropping it into the aperture. 
The data indicate that the general- 


ization from the former stimulus 
compound to the latter is more likely 
to occur in 8-10 yr. olds than in 5-6 
yr. olds. In the older age group it is 
more likely to occur after a correct 
initial choice than after an incorrect 
one. 


When these results are represented in 
terms of S-R associations (Kendler & 
Kendler, 1962), it becomes possible to 
describe the psychological difference in 
the behavior of the two age groups as well 
as to suggest the relevant mechanisms 
for the inferential behavior reported in 
this study. There appear to be three 
important characteristics of this be- 
havior. One is that making the correct 
initial choice in an inferential solution 
depends on the ability of S to generate a 
covert anticipatory response to the major 
goal and to respond appropriately to its 
cues. Such ability, according to this 
study, is positively related to age (within 
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the limits tested). A second character- 
istic is that inferential solution depends 
on a “short-cutting’’ process in which 
some overt S-R sequences previously 
learned must drop out. The younger Ss 
had difficulty in this respect. Repeating 
some of the responses practiced during 
training prevented many from exhibiting 
direct inferential solutions. The third 
characteristic of inferential behavior is 
that it is not governed completely by 
external events. If it were, then it 
would be expected that the availability of 
B following a response to X should yield 
as much integration as when it follows a 
response to A. Since it does not, one can 
conclude that the integration response 
must be influenced by some internal 
stimulus component associated with the 
correct initial choice. This source of 
stimulation is sufficiently potent to com- 
pete successfully, in with 
external stimulation there is a 
conflict between the This three- 
part characterization of inference is not 


some cases, 
when 
two. 


Its function is to 
analyze inferential behavior into more 
fundamental processes so that each may 
be investigated independently. 


offered as a theory. 


SUMMARY 


Children of two age levels, namely 5—6 and 
8-10 yr., were presented with a task that 
required the linkage of two out of three 
discretely acquired behavior 
The solution consisted of making an initial 
choice between two of the segments (one 
correct and the other incorrect) and then 
integrating the product of that choice with 
the third segment. The solutions could be 
direct (inferential), i.e., the goal achieved 
without any responses. They 
could be indirect, i.e., the goal achieved after 
the repetition of previously acquired but 
presently irrelevant behavior segments. The 
findings were: (a) Older Ss made significantly 
more correct initial choices than younger Ss 
(6) About half of the younger Ss ultimately 
reached the goal, but their method was 
primarily indirect. Almost all of the older Ss 
achieved solution and a majority of them by 
direct inferential means. (c) In inferential 
solutions the integration of the subgoal and 
major segments was a joint function of the 
relevance of the external 


segments of 


unnecessary 


stimulation (as 
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determined by preliminary training) and the 
correctness of the initial choice. (d) When 


these two contributing factors were experi- 
mentally balanced against each other, it was 
found that the internal stimulation associated 
with the correct initial response was about as 


important a determiner of integration as the 
relevance of the external stimulus. 
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Within the theoretical framework 
of Osgood’s (Osgood, Suci, & Tannen- 
baum, 1957) theory of meaning the 
sign of an object is a primary symbol 
assumed to evoke a representational 
mediating response that is some part 
of the total behavior emitted by the 
organism when stimulated by the 
object itself. This response produces 
distinctive cues, mediating behavior 
that would have oc- 
curred in absence of previous 
the with the 
The representational elements 


otherwise not 
the 
association of object 
word. 
(fm-Sm) correspond to the meaning of 
the sign, and because of the cue or 
stimulus components the sign can be 
conditioned to other stimuli initially 
lacking in meaning. Upon condition- 
ing such symbolic stimuli are desig- 
nated assigns. On the basis of the 


congruity principle distinctive re- 


sponses may be produced by novel 


stimuli if the assign, within appro- 
priate contextual arrangements, is 
used to label other novel 
Osgood et al., 1957) 


stimuli 


['ypical studies of this process are thos 
which meaning of verbal stimuli have been 
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are due to 


involved in the 
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changed by simple conditioning (Staats, 
Staats, & Biggs, 1958) and shown to generalize 
to synonyms of those stimuli (Staats, Staats, 
& Heard, 1959). Rhine and Silun (1958) 
demonstrated that development and strength 
of a concept-attitude were affected by the 
amount of reinforcement. (1955) 
provided reinforcements for a word and then 
associated that word with colored 
objects, thereby strengthening the probability 
that the object would be chosen by the S. 
Using a Treatments X Ss design, Osipow 
(1960) provided evidence that a color-name 
associated with either a positive or negative 
evaluative word resulted 
changes in preference for a nonsense figure 
when the color-name was associated with the 


Eisman 


various 


in corresponding 


figure. Since significant changes also occurred 
in the control group his results were incon- 
clusive. Di Vesta (1962) compared the 
effects of reinforcing a neutral 
with the effects of attaching the neutral color 
name to a positive evaluative sign in the first 
stage of the mediation process. It 
demonstrated that both procedures 
effective in changing preferences for a no 
sense figure when the color-name was sub 
sequently associated with the figure 


color-nam«e 


was 
were 


The primary concern in the present 
study was to compare the effect of 
two different experimental 
commonly used in the studies sum- 
marized above. These experiments 
were intended to extend the findings 
of the previous investigations by 
testing two that 
association of a neutral symbol with 
several signs, all of which represent 
similar polarities of the evaluative 
dimension of meaning, will result in 
movement in semantic space of the 


designs 


hypotheses: (qa) 


assign corresponding to the connota 
tive meaning of the signs associated 
with it; and (6) that labeling a neutral 
stimulus object (nonsense figure) with 
the conditioned assign will result in an 
evaluation of that stimulus 
corresponding to the acquired mea 


objec I 
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ing of the assign, independent of 
whatever effects may occur from 
acquired distinctiveness of cues result- 
ing from labeling alone. 

Several distinctions between the 
designs used in the present study and 
those used in previous investigations 
may be summarized: A Treatments 
xX Ss design with repeated measure- 
ments on each S was employed in Exp. 
1, and a two-factor design with re- 
peated measurements on each S was 
employed in Exp. II. Both experi- 
ments provide for control of learning- 
how-to-learn and warm-up effects. 
The effect of meaning was controlled 
by using experimental treatments in 
which neutral evaluative words were 
conditioned to the assign. In the 
Treatments X Ss experiment two ad- 
ditional controls were used, one for the 
use of a label without experimentally 
acquired meanings, and another for 
the evaluation of the stimulus object 
without the And, 


use of labels. 


finally, in both experiments the assign 


was associated with a nonsense figure 
as the stimulus object rather than 
with another verbal symbol. 


EXPERIMENT | 
Method 


Design.— Vhe general procedure used in the 
experiment was a modification of the mediated 
generalization paradigm. In Phase 1, the 
three sets of nonsense syllables were, respec- 
tively, conditioned to words with positive, 
neutral, and negative evaluative meanings 
In Phase 2, the Ss learned to name each of 
three different nonsense figures with one of the 
three conditioned nonsense syllables, a fourth 
figure with an unconditioned nonsense syl- 
lable, and Ss experienced a fifth figure without 
naming it. Thus, one of the figures was 
associated with a positive-conditioned non- 
sense syllable (positive treatment), a second 
with a neutral-conditioned nonsense syllable 
(neutral treatment), a third with a negative- 
conditioned nonsense syllable (negative treat- 
ment), a fourth with an _ unconditioned 


nonsense svilable (control-UNS), and a fifth 
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with no name attached to it (control). In 
Phase 3, the Ss rated each of the figures on 
three semantic differential scales. 

Materials.—Three decks of 5 X 8 in. cards, 
with 24 cards in each deck, were used in the 
verbal conditioning phase. One positive, 
neutral, er negative evaluative word was 
printed in block letters 1} in. high on each 
card. Examples of the eight positive evalua- 
tive words are RIGHT, CLEAN, BRAVE, and 
SMART; examples of the eight neutral evalua- 
tive words are MIDDLE, USUAL, AVERAGE, and 
MEDIUM; and examples of the eight negative 
words are WRONG, WICKED, FILTHY, and 
sTUPID. The negative and positive evaluative 
words were antonyms. Eight different words 
were used for each of the polar positions. The 
order in which the cards were arranged in the 
deck was determined at random with the 
restriction that each polar position was 
represented twice in every block of six words 
and that no more than one meaning occurred 
twice in succession in each block of six words. 
The same words were used in the second and 
third decks and differed from the first deck 
only in the order in which the words appeared. 

Three sets of semantic differential rating 
scales were used. All scales were measures 
of evaluative meaning. The first was a 
practice set to help Ss understand the 
procedure. The Pretty-Ugly and the Wise- 
Foolish scales were used. The second set, 
comprised of the Cruel-Kind, and Wise- 
Foolish scales, was used to determine whether 
Ss’ meanings for the nonsense syllables were 
changed in Phase 1. The third set, made up 
of Good-Bad, Pretty-Ugly, and Like-Dislike 
scales, was the measure used in Phase 3 to 
determine the evaluation of the figures. It 
should be noted that none of the scales dupli- 
cated any of the specific words used in the 
verbal conditioning procedure. All scales 
were five-point scales, with points 4 cm 
apart. The S was permitted to check any 
place on the scale that he felt best represented 
his judgment. The intensity of S’s response 
was obtained by measuring the distance, from 
the extreme end of the scale to his check mark, 
in centimeters, 1 cm. represented extreme 
negative ratings, 9 cm. marked the midpoint 
of the scale, and 17 cm. represented extreme 
positive ratings. 

Five nonsense figures were used, con- 
structed according to the procedure described 
by Attneave and Arnoult (1956) as Method | 
for angular shapes with closed contours. 
Each figure was approximately 3} X 4 in 
and was made of heavy white oaktag paper 
painted gray. These figures were mounted on 


a white oaktag square with sides 5} in. long. 
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Procedure.—The Ss were first introduced 
to the task and to the method of using the 
semantic differential scales by 
rate three sets of two pictures (similar to 
those in the Stanford-Binet Intelligence 
Scales) on the Pretty-Ugly scale to determin« 
their ability to make these evaluations. In 
addition, S rated examples of behavior such 
as “‘a boy crossing the street without looking,” 
and fictitious characters such as “Donald 
Duck” on the Wise-Foolish scales. All Ss 
proceeded succcessfully through this task. 

A procedure similar to that described 
earlier by Di Vesta (1961) was used in Phase 1. 
The Ss were instructed that they were to 
learn the meanings of three words (nonsense 
svilables) that they had not heard before 
Nine nonsense syllables were used throughout 
the experiment, 1 S receiving any three at 
random from among these, for example, Pip, 
LOM, and sup might have been used for 1 S 
and cit, pou, and LIM for another S. The 
polar position of evaluative meaning (posi- 
tive, neutral, or negative) associated with a 
particular nonsense syllable was randomly 
arranged within the total experiment in order 
to balance evaluations that might have been 
based on associative characteristics of the 
syllables In Exp. II the syllables were 
found to have no more than chance effects 
The S was then given the first deck of cards 
and instructed to look at the top card and 
indicate which of the syllables it 
defined. If S thought, for example, that 
“plain” was a definition for Lom, he was to say 
“Lom is plain.” The correction procedure 
was used. After S responded, E gave the 
correct response. If S was correct, he went 


having them 


nonsense 


on to the next card; if incorrect, he repeated 


the correct association before proceeding to 
the next card. This procedure was continued 
until S reached the criterion of responding 
correctly to two blocks of six words in each. 
This criterion was used since it signified that 
S responded correctly to each polarity at least 
four times with different words. If S did not 
reach this criterion after proceeding through 
the three decks twice, he was eliminated from 
the experiment. At the conclusion of this 
phase, S rated each of the nonsense syllables 
assigns) on the Cruel-Kind and the Wise- 
Foolish rating scales. 

In Phase 2 S was told that the syllables 
he had just learned about were also the names 
of certain figures. The S was shown five 
nonsense figures and told that since they were 
unlike any other figures that he may have 
seen, they could not be called by names like 
rectangles, triangles, and the like. The S was 
further instructed that four of the figures did 
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have names which he was to learn, and that 
the names corresponded in three cases to the 
syllables he had just learned, while the fourth 
figure also had a name (the unconditioned 
nonsense syllable) but that he had not learned 
it, and that the fifth igure had no name. The 
S was then shown the figures, one at a time, 
in random order until he name all 
labeled ones correctly in four successive 
presentations without error \ procedure 
similar to that used in the first stage was used 
in reinforcing correct responses. In order to 
randomize any effects of initial preference for 
the figure, pairing of assigns and figures was 
varied among Ss 

In the third phase, S was asked to rate 
the figures, one ata time. The order in which 
figures were presented to a particular S was 
determined at random. Each figure was rated 
on three semantic differential scales, 
Bad, Pretty-Ugly, and Like-Dislike 

In order to increase the level of motivatio: 
S was told that he would earn a toy if he 
participated in the experiment. At the 
conclusion of the session he was allowed to 
select his trinkets from 
crayons, baseball picture cards, balloons, and 
the like. 

Subjects. 


could 


Good 


choice of among 


The Ss were 24 children from 
one fifth-grade class in an elementary school 
However, 4 Ss were eliminated from the 
analysis because they failed to meet the 
criterion for learning in Phase 1 and 3 other 
Ss were eliminated because they failed to use 
the rating scales appropriately. Thus, all 
analyses were based on an N of 17. The Ss 
CAs were 9 and 10 vr. 


Re sults 


An average of 42.70 (SD 21.79 
pairings, excluding criterial trials, of 
nonsense syllables with signs was 
required to learn the evaluative mean- 
ings of the assigns and an average of 
28.25 (SD 17.63) pairings was re- 
quired to achieve the criterion in 
learning to associate the assign with 
the nonsense figures. The means and 
SDs for ratings of the nonsense sy! 
lables immediately after conditioning, 
on the Cruel-Kind and Wise-Foolish 
scales were compared.” Since no more 

2 Two tables in which are presented the 
means and SDs of assign ratings for each 
group in Exp. I and II, one table summarizing 
the learning data for Exp. II, and one table 
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than two scores overlapped in any one 
of the comparisons of the distributions 
of ratings for the three treatments, it 
was obvious that the differences were 
significant and no formal tests of sig- 
nificance were made. 

Each S had been administered all 
used three scales in 
the final ratings of the figures within 
each treatment. Accordingly, a split 
split-plots analysis of variance was 
used.* A summary of the Treatments 
xX Scales X Ss analysis is presented 
in Table 1. As is evident from the 
table, the Fs for the main effects of 
both Treatments and were 
significant (P < .01). A test of the 
significance of the differences between 
the means of the scales indicated that 
the ratings made on the Pretty-Ugl\ 
tended to be 


treatments and 


Scales 


scale generally more 
summarizing the analysis of variance testing 
the effects of nonsense syllables within each 
group in Exp. II, have been deposited with the 
American Documentation Institute. Order 
Document No. 7261 ADI Auxiliary 
Publications Project, Photoduplication Ser, 
ice, Library of Congress; Washington 25, 
D. C., remitting in advance $1.25 for micro- 
fiim or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, 


from 


Library of Congress. 
3 The assumption of 
variance was found to be tenable, via 


homogeneit\ ol 
Bart- 
lett’s test, for all analyses of variance except 
where otherwise indicated 
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rABLE 1 


ANALYSIS OF VARIANCE OF FINAL RATINGS 
OF FicureEs: Exp. I 


Source 


Between Ss 16| 24.18 
Within Ss | 

Treatments (T) | 4)! 195.62 
38.70 
63.83 
8 5.92 
8.82 


5.06* 


T X Ss (Error 1) | 64 
Scales (S) ie. 
1x5 
S X Ss (Error 2) | 160 


7.24* 
<1.00 


Total 254 


°P < Bi 


negative (P < .01) than the ratings 
made on either of the other two scales. 
(Similar results were found in Exp. 
Il.) The ratings made on the Good- 
Bad and the Like-Dislike scales were 
not significantly different (P > .05). 
The difference in the Pretty-Ugly 
scale did not interact with treatments 
as indicated by the F (< 1.00) for 
Treatments X Scales interaction. 
Thus only treatment means obtained 
by combining the ratings of the three 
scales to obtain a total were 
compared. The overall comparison 
of treatments using combined scores 
is represented by the F of 5.06 
(P < i). 

The means and SDs of individual 
scale scores and total scores for Treat- 


score 


rABLE 2 


MEANS AND SDs oF RATINGS OF FIGURES ON EACH SCALE 


AND ToTaL SCALES 


FOR EACH TREATMENT: Exp. I 


Treatment Good-Bad 


Mean SD 


12.71 
9.76 
7.00 
8.82 
8.94 


3.10 
4.24 
5.01 
2.29 
3.84 


Positive 
Neutral 
Negative 
Control-UNS 
Control 


Pretty-Ugly 


Scales 


Like-Dislike 


Sy 
~ 
Y 


Mean 
12.59 
9.29 
7.06 
741 
10.06 


© 


Sanu 
~Mmnonu 





SEMANTIC MEDIATION 
Table 2. 
Duncan’s multiple range test (Ed- 
wards, 1960) was used in testing the 
multiple comparisons between means. 
The Treatments X Ss error term was 
The 
comparisons of the positive evaluative 
treatment with each of the four other 
that all differ- 
ences significant (P < .001). 
The of the negative 
evaluative treatment with the neutral 
evaluative treatment and the control 


ments are presented in 


used in making the comparisons. 


treatments indicated 
were 


comparisons 


without labeling were also significant 
(FP? < 1). 
negative evaluative 
the 
was used was not significant 
> .05). 
sons among controls and the neutral 


10). 


The comparison of the 


treatment with 


labeling 
0 >.) 
None of the other compari- 


control in which only 


treatment were significant (? > 


I] 


EXPERIMENT 
Vethod 


Design The 


experiment was to replicate 


primary purpose of this 
a part ol Exp I 
while excluding carry-over effects from one 
treatment 
neglected 
groups, 


a consideration ofter 
Nine 


three 


to the next, 
by 
in a 


previous investigators 


two-factor design with 
repeated measurements in each group, 
One factor the 
polarities (positive, neutral, and negative 
evaluative 


were 
three 
of 
based 


Three 


used was based 


meaning the second was 
the use of three different 
figures were used 
of the results. 
between 

attributed 


of meaning. 


higures 
to increase the generality 
If no differences were found 
figures, the could be 
more definitely to the mediation 
In Phase 1 the nonsense syllable 
was associated with signs having positive 
evaluative meanings for the positive group; 
with neutral-evaluative meanings for the neu- 
tral group; and with negative-evaluative 
meanings for the negative group. In Phase 2 
the conditioned nonsense syllable was associ- 
ated with Fig. 1 by one-third of each group, 
Fig. 2 by another third, and Fig. 3 by the 
last third of each group. In Phase 3 Ss used 
the three scales, as in the previous experiment 
for rating the figure labeled by the assign. 
Procedure-——The following modifications 
were made in the procedure used in Exp. I. 


results then 


Ol 
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In 


one nonsense syllable 


Phase 1, S learned the meaning for only 
The same list of words 
was used as in Exp. | and in exactly the same 
However, Exp. Il S 
whether or not each of the 24 words was the 


form. in indicated 
meaning for the nonsense syllable to be condi 
if the 
LOM is plain” or 
the: 


tioned. Thus, for example, if he was in 
neutral group he would say, “ 
“LOM The E 
If 


card, 


recited the 
he 


re- 


is not brave 


il 


association 


to the 


correct 
turned 
peated the combination and_ ther 
turned to the next Eight of the 24 
words, depending upon the experimental cor 

dition, were used the of the 
nonsense syllable in each group Phase 
2 S then identified and three 
figures; one of the figures was labeled by the 
conditioned nonsense syllable the other 
two figures were named with unconditioned 
nonsense syllables. The three 
selected on the basis of pretest data with over 


was correct 


next incorrect he 
correct 


card 


definition 
In 


onl 


as 
named 
and 
ligures were 
150 children in which these figures had bee 
ranked as the most 


Three different 
used in each group 


neutral of six figures 
syllables 
Different svilables wer: 
The fact that the three 
fourth 


nonsense were 


used between groups 
svilables 


lactor 


might have comprised a 
in the total design was inadvertently 
ieglected. However, since equal numbers of 
Ss were assigned each of the syllables withi 
one condition, the data were analyzed by 
ol to 
whether the syllables used within each treat- 
had other than the 
In the three 
and _ syllables 
repeated 
etiect 


any 


separate analyses variance determine 


chance effects on 
thus 


ment 
made, 
d 
measures ever 
of syllables the 
interaction term in which syllables appeared 
P > .20 All but 2 
calculated were < 1.002 
that the 
the 


1 
results analyses 
factors a 


and 


higures were fixed 


scales were 


case, the main 
were not 
of the 12 Fs 
\ccordingly, the conclusion 
svilables had 
ratings made 

In S rated the figure labeled 
with svilable The 
were made with the same three scales used in 
Exp. I The figures labeled with the uncon 
ditioned syllables 


significant 
so 
was 
selected no effect or final 
Phase 3, the 


the conditioned ratings 


nonsense were not ratec 


since these data were not relevant to the 
hypotheses. 

Subjects.— The 81 
the fifth grade classes of a different elementar\ 
school from that used in Exp. L. 
9 and 10 yr. of age, and were randomly 
to the nine experimental groups 
lwelve Ss had been eliminated; 6 Ss failed to 
the task in Phase 1, and 1 S in the 


positive group, 4 Ss in the neutral group, and 


Ss children from 


were 
Che Ss were 


assigned 


learn 
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1 in the negative group failed to use the rating 
scales properly at the end of the experiment 


Results 


The means and SDs of the learning 
data as measured by errors and trials 
to reach the criterion for the three 
main experimental groups in Phases 
1 and 2 were compared.? The vari- 
ances of the learning data in Phase 
1 were heterogeneous requiring the 
use of the Kruskal-Wallis analysis of 
variance to test differences among the 
groups. Significant (P < .01) differ- 
ences among groups were found for 
both measures. Multiple comparisons 
were tested by the Mann-Whitney U 
test. The difference in errors between 
the positive and negative group was 
not significant (P > .05) while the 
comparisons of each of those groups 
with the neutral group were significant 
(P < .01). All comparisons ‘on 
the ‘‘trials’’ criterion were significant 
(P < .01). However, the order of 
learning was not cor- 
related with the order of the final 
preference rating; therefore, the as- 
sumption was made that trials to learn 
or errors made in learning were not 
related to changes in preferences, and 
that any differences in final preference 
could be attributed to the mediation 
of meaning. No significant differences 
were found in the comparisons based 


on either of the | 
) 


difficulty in 


ining measures in 
Phase 

The means and SDs of the assign 
ratings following the verbal condition- 
ing procedure in Phase 1 were sum- 
marized.?- These data compared favor- 
ably with those obtained for Exp. I. 

The Treatments X Figures X Scales 
analysis of variance of the ratings 
made of the figures in Phase 3° is 
summarized in Table 3. While the 
overall Treatment effects were sig- 
nificant (P < .001) the differences in 
Scales (P < .001) and the interaction 
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rABLE 3 


ANALYSIS OF VARIANCE OF FINAL RATINGS 
OF Ficures: Exp. Il 


Source 
Between Ss 
Treatment (T) 
Figures (F) 
TE 
Error (1) 


10.05** 
<1.00 


Within Ss 
Scales (S) 
‘xo 
F xs 
ce et ee. 

Error (2) 

Total 


*P <.01 
*P < 001 


of Treatments and Scales (P < .01) 
were also significant. The means and 
SDs of the ratings of the figures on 
each of the three scales are presented 
in Table 4. The overall ratings are 
also summarized to permit comparison 
with the data from Exp. I. Data for 
the specific figures have been com- 
bined since the F for differences 
between figures was < 1.00. In the 
multiple comparisons‘ all scales sig- 
nificantly (P < .01) differentiated the 
positive from the negative group; the 
Good-Bad and the Like-Dislike scales 
significantly (P < .05) differentiated 
the neutral from the negative group; 
and the Pretty-Ugly scale significantly 
(P < .05) differentiated the neutral 
from the positive group. None of the 
other comparisons was significant. 
When the data from the three scales 
were combined the differentiation 
among the three groups was more 
clear-cut and the trends clearly cor- 
responded to the conditioning of the 
syllables as well as to the data for 


‘The modification of Duncan's multiple 
comparison test described by Collier (1958) 
was used in testing the differences between 
treatment means on each scale. 
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TABLE 4 
FINAL RATINGS OF FIGURES ON EACH SCALE AND TOTAL SCALES 
BY EXPERIMENTAL Groups: Exp. II 


Grout 


Mean 


10.67 
9.30 
4.52 


Positive 
Neutral 
Negative 


The difference between the 


Exp. I. 


neutral and positive groups were sig- 
nificant (P < .05). The differences 
in the remaining two comparisons 
were also significant (P < .05). 


DISCUSSION 


The results of the two experiments 
that (a) acquire 
evaluative meaning through association 
with several signs having specific polarity 
on this dimension of meaning and that 
(b) these assigns transfer to affect the 


assigns 


demonstrate 


ratings of figures associated with them 
The preferences for the 
figures corresponded to the meanings, 


changes in 


rather than the specific words, with which 
the nonsense syllable had 
ciated. Where such associations had not 
been made, as in the two control groups 
of Exp. I, the result 
evaluation of the figure. 
imply 


been 


asso 


neutral 
These findings 


was a 


the occurrence of the mediation 
consisting of 
syllable, 
meaning. 


chain figure, nonsense 


associations, and connotative 
However, it is evident that the 
procedures used impose a severe limita- 
tion on this interpretation making an 
alternative explanation based on simple 
association principles an equally likely 
assumption. Thus, once the assign had 
been formed in the first learning phase it 
became a “synonym”’ for the associated 
These words, in turn, 
implicitly 

with the figure during the second learning 
phase when the figure was associated 
with the syllable. <A 


convincing demonstration of the media- 


words. may have 


been directly or associated 


nonsense more 


Like-Dislike 


Mean 


11.00 
9.89 
6.93 


tion process would necessitate a com- 
parison of the present experiments with 
those in which the two learning phases 
are reversed that is, a comparison of the 
present experiments with one in which 
the standard mediation paradigm is em 
ployed. 


planned 


These studies are currently being 
conducted in 
exactly the same settings in which the 
present studies were made. 


and are to be 


Since the main concern, however, was 
in a comparison of the two designs, their 
respective merits and limitations warrant 
attention. Basically, they yielded the 
same results and one is, in a real sense, 
a replication of the other. The intra- 
individual design appears to be more 
sensitive in the gross demonstration of 
the effect than is the individual 
ments design. 


treat- 
Among the assumptions 
of the intraindividual design are that 
there are no order effects; and that there 
are no carry-over effects from one treat- 
ment to another. The first is tenable on 
the premise that treatments were as- 
signed in random order to Ss. The 
second, however, is a very likely occur- 
rence in the major phases of Exp. I where 
Ss learned the three meanings concomi- 
tantly. The comparisons between the 
stimuli conceivably facilitate the distinc- 
Although 


the order of the figure was presented at 


tions in evaluative response. 


random, once one figure had been rated 
a comparison with the others was in- 
evitable ; and, since the same three scales 
were employed in each rating it would 
also be expected that carry-over effects 
would occur in the use of the scale. 
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In both experiments, there were sig- 
nificant differences among the results 
obtained from the different scales. In 
all but one instance, that is, in the use of 
the Pretty-Ugly scale in Exp. II, the 
preferences for all figures based on each 
of the scales were in the predicted direc- 
tion. The total score provided more 
adequate discrimination between groups 
in both experiments. It appears that the 
primary factor in the slight differences 
among scales may be accounted for in 
terms of lowered reliability when only 
one scale is used. The fact that other 
investigators, e.g., Staats, Staats, & 
Biggs (1958), have used a single scale 
with reliable results may merely reflect 
the ability of adults to use the scales with 
greater accuracy than children. 

Within the limitations discussed above, 
the present siudy tentatively suggests 
the applicability of the representational 
mediation process to the development of 
attitudes (Dodge, 1955; Osgood et al., 
1957). In terms of the operations of 
measurement with the semantic differen- 
tial the meaning of a stimulus object is 


its allocation in multidimensional space, 
with attitude defined as the projection of 
this point onto the evaluative dimension 
of that space (Osgood et al., 1957). The 
principal concern is with the common 
elements of the mediating response of the 


meanings of the signs. When the assign 
is associated with evaluative signs, the 
meaning of the assign will depend upon 
the mediating response evoked by these 
signs. The assign may be attached 
to other neutral stimulus objects to 
influence the responses evoked by these 
stimuli. Attitudes are thus characterized 
as implicitly learned processes with po- 
tentially bipolar evaluative properties. 


SUMMARY 


This study compared two experimental 
designs using procedures for the study of the 
mediation process. In both experiments the 
first phase consisted of assign development by 
conditioning signs with evaluative meaning 
to neutral nonsense syllables. Phase 2 in- 
volved associating the assign with a neutral 
nonsense figure. In the third phase Ss rated 
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the nonsense figure on three semantic differ- 
ential evaluative scales. 

There were 17 fifth-grade children in Exp. 
|. Baca 3 five treatments. The 
dependent variable was the rating of five 
different nonsense figures. Three of the 
figures were labeled by assigns previously 
conditioned in Phase 1 to signs having nega- 
tive, neutral, or positive evaluative meanings, 
respectively. A fourth figure was labeled by 
an unconditioned nonsense syllable and the 
fifth figure was experienced equally often with 
the others but was not labeled. Ratings, 
made on each of the scales, of the figures 
labeled with conditioned assigns corresponded 
with the evaluative meaning of signs associ- 
ated with the assigns. No significant differ- 
ences were found between ratings made of the 
figure labeled by the neutral assigns, that 
labeled by the unconditioned nonsense syl- 
lable, or the figure not labeled. 

In Exp. II there were 81 fifth-grade pupils. 
In a two-factor design with repeated measures, 
independent groups were required to associate 
signs, varying, respectively, on the negative, 
neutral, and positive polarities of evaluative 
meaning, to assigns. Within each group the 
conditioned assigns were used to label one of 
three different nonsense figures. 
ing two 
svilables. The this experiment 
corroborated those of the first. The overall 
ratings, using the combined scores of the three 
scales, differentiated significantly between the 
three groups. 
nificant 


received 


The remain- 
labeled by 


results of 


ligures were nonsense 


There was, however, a sig- 
interaction between treatments and 
scales, although in all but one comparison the 
differences were in the predicted direction. 
No significant differences were found between 
figures. Nor were there significant differences 
between the nonsense syllables used with any 
one of the treatment groups. 
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Several previous card-sorting and 
block-sorting experiments have com- 
pared the speed of making reversal 
and nonreversal shifts (Buss, 1953, 
1956; Harrow & Friedman, 1958; 
Kendler & D’Amato, 1955; Kendler 
& Mayzner, 1956; Kendler & Kendler, 
1959). Typically, a reversal shift has 
involved learning two successive sort- 
ing tasks, with correct responses for 
Task 2 based on the same 
dimension of the stimuli as on Task 1, 
but with S being required, literally, to 
reverse his previous sorting responses. 
For example, in Task 1, red cards 
must be placed in Sorting Category A 
and green cards in Category B, and in 
Task 2 red cards belong in Category B 
and green cards in Category A. Non- 
reversal shifts have involved learning 
one sorting task and then learning a 
second sorting task based on some 
dimension of the stimuli which was 
irrelevant in Task 1. 
experiments with 
have found that 
learned quicker 
shifts. 

In a number of the experiments in 
this area (Buss, 1956; Harrow & 
Friedman, 1958; Kendler & D'Amato, 
1955: Kendler & Kendler, 1959) it has 
been hypothesized that reversal shifts 
are learned more quickly because Ss 
respond to the same dimension of the 
stimuli was used previously in 
Task 1 learning, whereas in a non- 
reversal shift they are required to 
respond to a new dimension of the 


being 


Previous sorting 

students 
shifts are 
nonreversal 


college 
reversal 
than 


as 


1 Now at 
Medicine 


Yale University School of 
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stimuli. According to this analysis, 
reversal shifts are learned quickly 
because the same cues that were used 
during Task 1 learning are again 
relevant during the learning of Task 2. 
The S must merely learn to make 
different responses to the previously, 
used cues. On the basis of this hy- 
pothesis the results in the above ex- 
periments have been interpreted as 
supporting a mediational S-R frame- 
work. However, reversal groups in 
the above experiments, besides using 
the same dimension of the stimuli that 
was previously relevant, have also 
been required to make literal reversals, 
in which the exact opposite sorting 
response was required. 

If the analysis in terms of the ad- 
vantages of using the same dimension 
of the stimuli is correct, then this 
condition alone should be sufficient to 
produce facilitating effects. Accord- 
ing to the above analysis supporting 
the mediational S-R framework, it 
would be expected that a shift which 
allows S to respond to previousl\ 
relevant cues (for purposes of sim- 
plification and to maintain the tradi- 
tional terminology used previously, 
this type of shift will also be called 
a reversal shift) would be learned 
quickly even if the new responses that 
are required are not the exact opposite 
of the previously learned responses. 
Thus, after learning a concept based 
on the number of stimulus elements, a 
reversal group switching to a second 
number concept should learn 
quickly than a _ nonreversal 


more 
group 


which shifts to a concept based on the 
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position of the stimuli, regardless of 
whether or not exactly opposite sort- 
ing responses are required. The 
present experiment attempted to test 
this analysis. 

An interesting factor which has been 
involved, in an incidental manner, in 
some of the previous experiments con- 
cerning reversal and nonreversal shifts 
(Kendler & D’Amato, 1955; Kendler 
& Mayzner, 1956) was also investi- 
gated. This variable is the use of 
consistent and inconsistent responses. 
These responses occur in experiments 
which require Ss to sort response 
cards into compartments labeled with 
stimulus cards. Consistent responses 
occur when response cards are placed 
with stimulus cards which are similar 
to them in some respect (e.g., when S 
is required to sort response 
according to their color and must 
place red response cards into a com- 
partment labeled with a red stimulus 
card). Inconsistent responses 
when there is little or no similarity 
between stimulus card and its 
appropriate response cards (e.g., when 
S is required to sort response cards 
according to their color, but must put 
red response cards with a 
stimulus card). 


cards 


occur 


the 


nonred 


While consistent and inconsistent 
responses have appeared in some of 
the experiments comparing reversal 
and nonreversal shifts the influence of 
this variable on the ease of making 
these shifts not been directly 
investigated. It is possible that the 
relative difficulty of reversal shifts as 


has 


opposed to nonreversal shifts may be 
the 
required are consistent or inconsistent. 


affected by whether responses 
The present experiment was designed 
to investigate the influence of this 
variable. a card- 
sorting situation which varied con- 


This was done in 


sistent and inconsistent responses in a 


systematic fashion while comparing 
reversal and nonreversal shifts. 


\MIETHOD 


Subjects.—The Ss were 83 undergraduate 
students enrolled in introductory psychology 
courses at Indiana University. Nineteen of 
these Ss were eliminated for failure to learn 
the first concept. 

Materials.—The experiment carried 
out in a four-category card-sorting situation 
in which were sorted into 
categories labeled with stimulus cards \ 
deck of 16 response cards, 3 X 3 in. in size 
and similar in principle to the Wisconsin 
Card Sorting Test (Berg, 1948) was devised 
Small green stars were placed on these cards 
and the cards varied according to the number 
and position of the stars on them. There 
were one, two, three, or four stars placed on 
each response card. The stars were arranged 
systematically with all possible combinations 
of number of stars appearing in each of the 
four corners of the cards. This made a set of 
16 response cards, each of which could be 
sorted according to the number of stars on it 
or according to the position of the stars on it 

There were 4 stimulus cards and the Ss 
were required to sort the 16 response cards 
with them. These stimulus cards were at 
tached to a black card holder. The first 
stimulus card had one black line in the upper- 
left hand corner (Card UL1), the second card 
had two black lines in the lower-left-hand 
corner (Card LL2), the third had three black 
lines in the lower-right hand corner (Card 
LL.R3), and the fourth had four black lines in 
the upper-right hand corner (Card UR4 
At the top of the card holder, which the 
stimulus cards were attached to, was a pocket 
into which could be placed a response card 


Was 


response cards 


rhree different “position” concepts were 
used (where the correct sorting of each 
response card depends on the position that the 
stars are placed on the card) and three differ- 
ent “number” concepts were used (where the 
correct sorting of each response card depends 
on the number of stars on the card). The 
three position concepts were a consistent 
position concept and two inconsistent position 
concepts (these latter two were a clockwise 
position concept and a counterclockwise posi- 
tion concept). The consistent position 
concept required that response cards which 
had stars located in Corner X be placed with 
the stimulus card which had its lines located 
in the same corner, Corner X. For example, 
response cards having stars in the upper-left- 
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hand corner were placed with Stimulus Card 
UL1, which also had its lines located in the 
upper-left-hand corner. Likewise response 
cards having stars. in the lower-left-hand 
corner belonged with Stimulus Card LL2, 
response cards having stars in the lower-right- 
hand corner went with Stimulus Card LR3, 
and response cards having stars in the upper- 
right-hand corner belonged with Stimulus 
Card UR4. One of the inconsistent position 
concepts (the clockwise concept) required 
that response cards which had stars in 
Corner X be sorted with the stimulus card 
which had its lines located in the corner 
which is one position clockwise to Corner X. 
Thus, cards having their stars 
located in the lower-left-hand corner had to be 
sorted with Stimulus Card UL1, which had its 
lines located in the upper-left-hand corner 
(one position clockwise). Likewise, response 
cards having stars in the lower-right-hand 
corner belonged with Stimulus Card LL2, 
response cards having stars in the upper- 
right-hand corner went with Stimulus Card 
LR3, and cards having stars in the upper- 
left-hand corner belonged with Stimulus Card 
UR4. The other inconsistent position con- 
cept (the counterclockwise concept) required 
that response cards which had stars in Corner 
X be sorted with the stimulus card which had 
its lines located in the corner which was one 
position counterclockwise to Corner X 

The three “number” 
consistent number concept, 
consistent number concepts—an (mn + 1) 
concept and an (mn — 1) The 
consistent number concept required that re- 
sponse cards with m stars on them be sorted 
with the stimulus card which also had n lines 
on it. Thus, response cards which had one 
star had to be sorted with the stimulus card 
with one line (Card UL1). Likewise, response 
cards having two stars belonged with Stimulus 
Card LL2, response cards having three stars 
went with Stimulus Card LR3, and response 
cards having four stars went with Card UR4. 
One of the inconsistent number concepts—the 
(x +1) concept—required that response 
cards which had m stars be sorted with the 
stimulus card which had (m+ 1) lines 
Hence, response cards which had one star 
had to be sorted with the stimulus card which 
had two lines (Card LL2). Similarly, re- 
sponse cards having two stars belonged with 
Stimulus Card LR3, response cards having 
three stars went with Stimulus Card UR4, 
and response cards having four stars belonged 
with Stimulus Card UL1. The 
consistent number concept—the (n — 1 
required that Ss sort response cards 


response 


concepts were a 


and two in- 


concept. 


other in- 


concept 
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which had nm stars with the stimulus card 
which had ( — 1) lines on it. 

Design.—In the present experiment an 
attempt was made to test the previous 
analysis that the facilitative effects of reversal 
shifts are due to the advantages of using the 
same dimension of the stimuli when shifting 
concepts. An attempt was made to 
examine the influence of consistent and in- 
consistent responses on reversal and 
reversal shifts. This was achieved in a three- 
dimensional factorial design, in which Ss 
learned two successive card-sorting tasks. 
During Task 1 all correct responses were 
inconsistent with half of the Ss 
an inconsistent position 


also 


non- 


learning 
and the 
number 


concept 
other half learning an inconsistent 
concept. 

For Task 2 Ss were divided into eight 
subgroups with half of the Ss learning a 
concept involving the same dimension of the 
stimuli as was previously relevant (reversal 
shifts) and half learning a concept involving 
a different dimension of the stimuli than was 
previously relevant (nonreversal shifts). Half 
of the Ss learned a concept involving consist- 
and half learned a concept 
involving inconsistent responses. Similarly, 
half of the Ss learned position concepts and 
half learned number concepts. This _per- 
mitted a 2 X 2 X 2 factorial design 

Procedure.—The Ss were tested indivi- 
dually. The instructions read to S indicated 
that as each response card was shown to him 
he should point to the stimulus card he 
thought it belonged with, and that by E telling 
him whether he was right or wrong he would 
gradually find out where each response card 
really belonged. 


ent responses 


The response cards were presented ran- 
domly and individually to each S by being 
placed in the slot which was at the top of the 


card holder. The criterion of learning for 
both Tasks 1 and 2 was 12 successive correct 
responses. All Ss who met the criterion on 
Task 1 within 160 trials were required to 
learn Task 2. 

The change in the pattern of reinforcement 
for the second concept was made without in- 
forming Ss. The Ss who had not learned the 
within 500 trials were ar- 
bitrarily assigned a score of 500 for Task 2 
learning. 


second concept 


In order to avoid partial reinforcement of 
the first concept during learning of the second 
concept (Buss, 1956; Gormezano & Grant, 
1958; Harrow & Friedman, 1958) 4 response 
cards were eliminated from each set of 16 
cards, for each group of nonreversal Ss. This 
left 12 response cards in each deck. The same 
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rABLE 1 


NUMBER OF 


[RIALS TO LEARN First CONCEP 


(TASK 1) 


AND SECOND CONCEPT (TASK 2) 


Condition 
Task 1 Task 2 


Response*; Concept Response* Concept 


Position 
Number | NR 


Incon. 
Incon 
Consist 
Consist. 


Position 
Position 
Position 
Position 


Incon. 
Incon. 
Incon. 
Incon. 


Number | NR 


Number | Rev 
Position NR 
Number - Rev 
Position | NR 


Incon. 
Incon 
Consist. 
Consist 


Number 
Number 
Number 
Number 


Incon. 
Incon. 
Incon. 
Incon 


* Incon. = inconsistent; consist 
Rev reversal; NR not 


= consistent 


reversal 


4 response cards were also eliminated for the 
corresponding reversal groups 


RESULTS 


The data concerned with the learn- 
ing of the first concept are reported in 
Table 1. Both these results and those 
for the second concept represent the 
number of trials to learn the task, 
excluding the 12 criterion trials. To 
test for differences in speed of learning 
between Ss of different groups learn- 
ing the same first task, two separate 
analyses of variances were computed. 
One analysis of variance compared 
the four groups initially learning an 
inconsistent position concept and one 
compared the four groups initially 
learning the inconsistent number con- 
cept. Using 3 and 28 df in each case 
for the position and number concepts, 
the overall Fs were, respectively, 0.48 
(P > .05), and 1.36 (P > .05). Thus, 
the data indicate that the four groups 
of Ss learning each type of task were 
equated with each other initially. 

The results for learning the second 
concept are also presented in Table 1. 
In order to determine whether there 
were any differences in the speed of 


Rev. 


Position | Rev. 


Trials to Learn 
T ask 


Mdn Range | Mean 


13-133 
6-136 
10-127 
40-133 
25-91 
3-44 
3-110 
2-100 


learning the concepts among the eight 
experimental groups, a 2 XK 2 X 2 
analysis of variance was carried out. 
Due to the skewness of the data, a 
logarithmic transformation was used 
in place of the raw scores to obtain 
homogeneity of variance (Edwards, 
1950). The results, as can be seen in 
Table 2, show that the reversal groups 
learned significantly faster than the 
nonreversal groups (P < .001). Like- 
wise, concepts requiring consistent 
responses were learned significantly 
faster than those requiring inconsis- 
tent responses (P < .001). The sig- 


rABLE 2 


\NALYSIS OF VARIANCE OF LOG TRIALS 
ro LEARN SECOND CONCEPT 


Source 


Reversal-nonreversal (R-NR 
Consistent-inconsistent (C-IC 
Number-position (N-P 
C-IC K R-NR 

N-P & R-NR 

N-P X C-IC 

N-P X R-NR X C-IC 
Within groups (MS 


*P <0! 
*P < 001 
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nificant interaction (P < .01) between 
the reversal-nonreversal and 
tent-inconsistent groups, appears to 
be due to the much slower learning 
of the two nonreversal-inconsistent 
groups, as can be seen in Table 1. 
Further breakdown of the reversal- 
nonreversal comparison was done by 
means of individual ¢ tests. The 
transformed scores were used and the 
mean square within groups, obtained 
from the analysis of variance, was 
used as the basis for the error term. 
Two-tailed ¢ tests showed that both 
kinds of reversal groups learned sig- 
nificantly faster than the comparable 
nonreversal groups. The _ reversal 
groups which shifted concepts within 
the same dimension of the stimuli, 
without making a literal reversal of 
their previous responses (nonliteral 
reversals), learned significantly faster 
than the comparable nonreversal 
groups (¢ = 2.65, df = 56, P < .02). 
The reversal groups which both 
shifted concepts within the same di- 
mension of the stimuli and also made 
responses which were literal reversals 
of their previous responses, learned 
significantly faster than the compar- 
able nonreversal groups (¢ 6.43, 
df = 56, P < .001). The two types 


consis- 


of reversal groups did not differ sig- 


nificantly from each other (¢ = 1.05, 
df = 56, P > .05). There was a sig- 
nificant difference between the two 
kinds of nonreversal groups (tf = 4.83, 
df = 56, P < .001). 


DISCUSSION 


The quic ker learning of reversal shifts 


as opposed to nonreversal shifts 
this 


should be remembered that in the present 


was 


again found in experiment. It 
experiment the label, reversal group, was 
extended to all groups that 


second concept which required discrimi- 


learned a 


nation according to a dimension of the 
that 
during Task 1 learning. 


stimuli was previously relevant 


It was found, in 


MARTIN HARROW AND ALEXANDER M. BUCHWALD 


all cases, that a second sorting task 
which was based on the same dimension 
of the stimuli as the first task was learned 
quicker than comparable concepts which 
were not based on the same dimension of 
the stimuli. This occurred even when 
the reversal task did not require an exact 
literal reversal of previous sorting re- 
sponses. Thus, the data support the 
previous analysis of Buss (1956), Harrow 
and Friedman (1958), and Kendler and 
D’Amato (1955), who hypothesized that 
a reversal shift is learned quickly because 
it has the advantage of using a dimension 
of the stimuli which was previously 
relevant. Ina similar manner the results 
fit in with a mediational S-R approach. 
The data also indicate that concepts 
requiring consistent responses are learned 
more quickly than concepts requiring 
inconsistent responses. These results are 
not surprising. It seems probable that Ss 
have frequently made other similar con- 
sistent responses before, in their daily 
lives, and that in the present experiment 
they quickly generalized to this par- 
ticular situation. 
The significant 
reversal-nonreversal 
consistent 


interaction between 

and _ consistent-in- 
groups suggests that the 
relative difficulty of reversal as opposed 
to nonreversal shifts is affected by 
whether the concepts used require con- 
sistent or inconsistent responses. Due 
to this, when both consistent and 
inconsistent responses are used in experi- 
ments of this type they should be con- 
trolled systematically, whether the in- 
terest isin the consistent and inconsistent 
responses (which in themselves have wide 
general applicability) or whether they 
are just used incidentally. In the present 
experiment it seems appropriate to 
analyze the significant interaction term 
with respect to the differences (P < .001) 
between the consistent and inconsistent 
nonreversal groups. Although the literal 
reversals involved inconsistent responses 
and the nonliteral reversals did not, the 
significant interaction appears to reflect 
the significant differences between the 
two types of nonreversal groups. Fitting 
this interpretation, the means of the two 
kinds of reversal groups did not differ 
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It should 
also be noted, concerning inconsistent 
responses, that the groups required to 
make inconsistent 


significantly from each other. 


responses were more 


sensitive to the experimental conditions. 


Thus, in similar card-sorting experiments 


it may be advisable, when practical, to 
use groups making inconsistent responses, 
due to their greater sensitivity. 


SUMMARY 


The present experiment tested the notion 
that reversal shifts are learned more quickly 
than 
responding to a dimension of the stimuli which 


nonreversal shifts because they involve 

This was accomplished 
card-sorting situation in 
both number and position concepts 
used 


was used previously. 
in a tour-category 
which 
were Some concepts required con 
sistent responses and other concepts required 
Sixty-four Ss learned 
tasks. 
required inconsistent 
During Task 2 half of the Ss 
reversal tasks and half learned non- 
tasks. Also, half the concepts used 


consistent 


inconsistent responses 


two successive card-sorting During 


Task 1 all 


responses 


concepts 


learned 
reversal 
required responses and half re 
quired inconsistent responses 

The results indicated that: (a) All types of 
reversal tasks (tasks requiring the use of a 
diminsion of the stimuli which was previously) 
relevant trials thai 
comparable nonreversal tasks (tasks requiring 
attention to a different dimension of the 
stimuli). Thus the previously reported hy 
supported b) Concepts 
learned 
in fewer trials than concepts requiring it 

The relative diffi 
as opposed to nor 


were learned in fewer 


potheses were 


requiring consistent responses were 
consistent 
culty of reversal 
reversal shifts is 


responses. 
shifts 


affected by whether cor 
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sistent or inconsistent responses are used, i.e., 
there was a significant Reversal-Nonreversal 
< Consistent-Inconsistent interaction 
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THE SERIAL 
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POSITION EFFECT OF 


FREE RECALL 


MURDOCK, Jr. 


University of Vermont 


Recently Murdock (1960) has 
shown that in free recall R,, the total 
number of words recalled after one 
presentation, is a linear function of t, 
total presentation time. Nothing was 
said about the serial position effect, 
though this is a well-known phe- 
nomenon of free recall (e.g., Deese & 
Kaufman, 1957). However, given 
that there is a serial position effect, 
the simple linear relationship between 
R, and t is rather surprising. 

In the customary serial position 
curve of free recall, probability of 
recall is plotted as a function of serial 
position. This means, then, that the 
area under the serial position curve is 
equal to Ry, the number of words 
recalled after one presentation. If R, 
is a linear function of t then it must 
follow that the area under the serial 
position curve is also a linear function 
of t. However, it is not immediately 
apparent how the serial position curve 
varies with t in such a 
maintain this simple 
tionship. 

The present experiment was de- 
signed as an attempt to determine 
how the serial position curve varied 
with list length and presentation rate 
while still maintaining this linear 
relationship. Unfortunately, at the 
end of the experiment it was still not 
clear how this relationship came about 
or, for that matter, whether the rela- 
tionship was even linear after all. 
The basic 


Way as to 


linear rela- 


reason for this failure was 


!This study was supported by a research 
grant, M-3330, from the National Institutes 
of Health. The author would like to thank 
Ellen Lissner, Cynthia Marvin, Frank 
Warhurst for analyzing the position 
data. 


and 
serial 


that the trends which did show up 
were not consistent enough to justify 
any clear-cut conclusions. However, 
a rather definite picture of the serial 
position curve itself did emerge from 
the data. Therefore, the present 
article will be restricted to a quanti- 
tive description and attempted ex- 
planation of the serial position curve 
of free recall. 


PROCEDURE 


Six groups each had a different combina- 
tion of list length and presentation rate. 
These six combinations were 10-2, 20-1, 15-2, 
30-1, 20-2, and 40-1; the first number indicates 
list length and the second number indicates 
presentation time (in sec.) per item. Thus, 
10-2 means a list of 10 words presented at 
a rate of 2 sec/item. Notice that the first 
two, middle two, and last two groups were 
matched for t, total presentation time (20, 
30, and 40 sec., respectively). 

For each group there were 80 different lists. 
The lists were constructed by 
selecting words from the (approximately) 
4000 most common English words (Thorn- 
dike-Lorge, 1944, G count of 20 and up), 
except that homonyms, contractions, and 
archaic words were excluded. 


randomly 


Group testing was used. 
to Ss either 


Lists were read 


at every beat (presentation rate 


of 1 sec/item) or at every other beat (pres- 


item) of an electric 

a rate of 60 beats/min. 
After each list there was a recall period of 

1.5 min. The 


entation rate of 2 sec 


metronome set at 
Ss wrote down as many words 
as they could remember in any order that 
they wished. Each recall period was ter- 
minated by a verbal “Ready” signal which 
preceded the start of the next list by 5-10 sec. 
\ll groups were given 20 lists per session and 
four sessions; successive sessions were spaced 
2-7 days apart. Nothing said about 
rehearsing while the lists were being presented. 

In all there 103 Ss, students of both 
sexes from the introductory psychology course 
who were fulfilling a course 
Exact Ns by group are shown in ’ 


was 
were 


requirement. 


Table 1. 
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RESULTS 


The data were first analyzed to 
determine if practice effects occurred 
over the four sessions. Analyses of 
variance showed that there was a 
significant (P < .01) improvement 
over the four sessions for Groups 10-2, 
15-2, and 20-2; whereas the effect was 
significant at only the .05 level for 
Group 30-1 and was not significant 
(P > .05) for Groups 20-1 and 40-1. 
However, the largest difference ob- 
tained between the best and the worst 
session for any one group was 1.13 
words, and all other intersession dif- 
ferences were less than 1.0 words. 
Therefore, when this practice effect is 
divided into four sessions and any- 
where from 10 to 40 serial positions 
its effect on the serial position curves 
was negligible. 

Table 1 shows the means and SDs 
of the number of words recalled per 
list (R,). Each mean is based on 80 
lists per S and from 15 to 19 Ss per 
group. As predicted, groups with the 
same total presentation time did not 
differ significantly in mean number 
of words recalled. 


% 


. 


That is, no signifi- 
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POSITION EFFEC] 


OF FREE RECALL 


rABLE 1 


MEAN NUMBER OF WorDS RECALLED 


Group 


Mean SD 


0.76 
1.16 
1.40 
1.98" 
2.08 
1.08 


cant differences were found between 
Groups 10-2 and 20-1 (¢ = 1.39), be- 
tween Groups 15-2 and 30-1 (¢ = 1.00), 
or Groups 20-2 and 40-1 (¢ = 0.48). 
The serial curves are 
shown in Fig. 1. Probability of recall 
is plotted as a function of serial 
position. For greater generality, we 
would also like to use the data from 
studies by Murdock and Babick (1961) 
and Deese and Kaufman (1957). In 
the Murdock-Babick study there were 
18 Ss each tested on 80 different 25-1 
lists. In the Deese-Kaufman study 
there were two groups of 16 Ss each; 
one group was tested on 10 different 
10-1 lists and the other group was 
tested on 10 different 32-1 lists. The 


position 


SERIAL POSITION 


Serial positio 


curves for the six groups 
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serial position data were presented in 
the original article as Fig. 1 (p. 182) 
and we read the points from the two 
curves as accurately as_ possible. 
These three serial position curves are 
shown here as Fig. 2. 

We have, then, nine different serial 
position curves. In general, the 
curves seem to share certain general 
characteristics: a marked recency 
effect, a flat middle section, and a 
primary effect which is more precipi- 
tous though smaller in magnitude 
than the recency effect. The presence 
of a flat middle section, or asymptote, 
is clearest in the 40-1 list (Fig. 1), but 
becomes less and less obvious as list 
length decreases. Actually, in the two 
10-word lists the primacy and recency 
curves may have intersected each 
other before an asymptote has been 
reached. 


More specifically, the recency effect 
can adequately be described by the Gom- 
pertz double-exponential function. As 
given by Lewis (1960, p. 81) the equation 
is y = vg". Probability of nonrecall(y) 
was plotted as a function of list length 
minus serial position (x). Thus, the last 
word in a list would have an x value of 0, 
the next to last word an x value of 1, etc. 
Both v and g were fractional and positive. 
The asymptote v was determined from 
the mean recall probabilities averaged 
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8 


PROBABILITY OF RECALL 


PROBABILITY OF RECALL 
3 


SERIAL POSITION 


Fic. 2. Serial position curves for 10-1 
and 32-1 lists (Deese & Kaufman, 1957) and 
25-1 lists (Murdock & Babick, 1961). 


over the flat part on each serial position 
curve. The constants g and h were ob- 
tained by a least squares method de- 
scribed by Lewis (1960, pp. 82-88) using 
the last eight points of each serial posi- 
tion curve (except of the two 10-word 
lists where only the last four or five 
points could be used). 

The evidence for this conclusion is 
shown in Table 2 under the 7? column. 
In all cases the Gompertz equation 
accounted for more than 95% of the 
variance, and the mean coefficient of 
determination (r?) was 97.79%. 

Since in all nine cases g < 1/e the 
recency effect is consistently an S shaped 
curve. This characteristic can be seen 
in the serial position curves of Fig. 1 and 
2. Starting from the last serial position, 
each curve is initially positively de- 
celerated and then soon becomes nega- 
tively decelerated. 


rABLE 2 


VALUES FOR GOMPERTZ DOUBLE-EXPONENTIAI 


FUNCTION TO FI1 


SERIAL PosITION CURVES IN FREE RECALL 


.548 
852 
.622 
814 

730 
885 

851 
.566 
.840 


* From Murdock and Babick (1961). 
b From Deese and Kaufman (1957 


| Oa 
98.8% 
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99.39, 
95.99%, 
98.7‘ 
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The yo column gives the value of y 
when x = 0. If yo is subtracted from 
1.00 this gives the probability that the 
word in the last serial position will be 
correctly recalled. The results for the 
six groups of the present experiment were 
very similar to each other, and an 
analysis of variance of the number cor- 
rectly recalled showed that the groups 
did not differ significantly (F = 1.61, 
df = 5/97, P > .05). The recall prob- 
abilities were rather high but they were 
not 1.00 (and had they been the Gom- 
pertz would not be applicable); the 
corresponding recall probabilities for the 
Murdock-Babick and Deese-Kaufman 
data were clearly lower. 

The inflection point occurs between 
the second and third words from the end 
of the list and appears to be essentiall\ 
independent of list length and presenta 
tion rate. The evidence for 
clusion is given under the x 
Table 2, =—In (—In g)/Inhk (hi 
is log base e). That is, x 
point, that x value at which the decelera- 
tion changes from positive to negative. 
The x; values range from 0.5 words to 2.1 
words with a mean of 1.57 words. Since 
the last word in any list has an x value 
of 0, a mean of 1.57 words places the in 
flection point midway between the second 
and third words from the end of the list 

Actually, both Deese-Kaufman curves 
appear to have inflection points nearer 
the end of the list than any of the other 
curves. Otherwise, however, the inflec- 
tion points cluster rather closely in the 
range of 1.5—2.1 words. 

The recency effect extends over the 
last eight serial positions and appears to 
be essentially independent of list length 
and presentation rate. The evidence for 
this conclusion is given under the x 
column, where x.9; is that value of x at 
which the curve is 95% down. That is, 
at this point forgetting is 95% of the 
asymptotic value. 


this con 
column of 
where x 
is the inflection 


T he 95° 4 level serves 


as a convenient criterion to mark the 
end of the recency effect. 

The mean of the x; column is 6.88 
words or, rounded off to the nearest whole 
number, 7 words. Except for the Deese- 


Kaufman 10-1 list all the values seem to 
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be very close to 7 words. Since the x 
value is 7 words, the recency effect ex- 
tends over the last eight serial positions. 

Another way of indicating the simi- 
larity among different lists is by the h 
column of Table 2. In the Gompertz the 
constant h determines the rate of change. 
Since the values of h are all rather 
similar this indicates that all curves have 
a similar rate of change, and if they have 
a similar rate of change all curves should 
level out at about the same x value if the 
numerical values of g do not differ too 
greatly. 

The primacy effect appears to extend 
over the first three or four serial posi- 
tions. This can be seen in the serial 
position curves of Fig. 1 and 2, as all of 
the curves seem to level out at about the 
third or fourth serial position. The 
primacy effect is so short-lived that the 
curve is difficult to describe mathe- 
matically. Actually, it may well be 
exponential. Semilog plots of the first 
three or four points of the nine curves 
(using 1.00 — vas the asymptote for each 
curve) gave reasonable approximations 
to straight lines and the slopes were 
rather similar to one another. A group 
curve based on the mean (y—c) values of 
the individual curves was an excellent 
fit; the rate constant was 0.77 and the 
intercept was .27 (see Murdock & Cook, 
1960). However, the fact that this group 
curve was based on only three points 
should make one hesitant about placing 
too much confidence in it. 

Finally, the primacy and recency ef- 
fects are spanned by a horizontal asymp- 
tote. The asymptote is considered to 
extend from Serial Position 5 up to the 
That is, ina 
20-word list the asymptote would extend 
from Serial Position 5 through Serial 
Position 12, in a 30-word list from Serial 
Position 5 through Serial Position 22, etc. 
That the asymptote is essentially hori- 
zontal is suggested by the middle parts 
of the serial position curves of Fig. 1 
and 2. 


last eight serial positions. 


A close examination of the serial 
position curves suggests that the 


trend line may have a small positive 
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TABLE 3 


PREDICTED AND OBTAINED INCREMENTS 
FOR ASYMPTOTI 


Group Obt 


.100 
— .004 
015 | 
.038 
.036 
.046 


slope rather than a zero slope. How- 
ever, this positive slope could be due 
to the fact that the recency effect is 
only 95% down; i.e. 5% of the effect 
remains to exert an effect on the 
(allegedly) horizontal asymptote. 


The proper test of this conclusion, 
then, is to determine whether the ob- 
tained increment (if any) is greater 
than the increment attributable to the 
5% remaining from the recency effect. 


The following analysis deals only 
with lists of 20 words or more; the 10 
and 15 word lists could not be used 
because there were too few points. 
For each of the six lists the obtained 
increment was found by fitting a 
least squares regression line to the 
asymptote, determining its slope, then 
multiplying the slope by Ax where Ax 
is the difference between Serial Posi- 
tion 5 and the seventh-from-last serial 
position (Ax = 8 for the 20-word list, 
Ax = 13 for the 25-word list, etc.). 
The expected increment was found by 
obtaining the predicted y value from 
the Gompertz equation for the two 
values of x (Serial 
seventh-from-last position), 
then subtracting. For each list the 
constants shown in Table 2 were used. 
The predicted and obtained incre- 
ments are shown in Table 3; the 
difference between the predicted and 
obtained increments was not sta- 
tistically significant (¢t == 0.72,df = 5). 
Thus, the asymptote does appear to be 


Position 5 and 


serial 
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horizontal, and the slight positive 
slope to the curve is no greater than 
would be expected from the tail end 
of the recency effect. 


DISCUSSION 


We have presented data to show that 
the serial position curve of free recall is 
characterized by a rather steep (possibly 
exponential) primacy effect, an S shaped 
recency effect, and a horizontal asymp- 
tote extending between the primacy and 
recency effect. An idealized curve for a 
24-word list is shown in Fig. 3. Its equa- 
tion is 


p = 1.00 + .27 


where L is list length and x is Serial 
Position 1, 2, 3,...L. The constants 
for the primacy effect were those of the 
group curve discussed above while the 
constants for the asymptote and the 
recency effect were the mean values of 
the constants given in Table 2 for the six 
lists of the present experiment. 

The curve of Fig. 3 is an empirical 
curve, not a rational curve. It is an 
attempt to describe the serial position 
effect of free recall quantitatively, not 
explain it. Not only does this empirical 
curve represent the nine curves of Fig. 1 
and 2 quite well, but also it is consistent 
with several other data. For 
one, it agrees with serial position curves 
for 20-1 lists reported by Deese (1957, 


sets of 
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Fic. 3. Idealized serial position curve 


for 24-word list. 
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Fig. 1, p. 580). For another, it agrees 
well with some unpublished curves culled 
from several experiments recently re- 
ported by Murdock (1960). Finally, the 
exact same trends are present in some 
memory-span data reported by Waugh 
(1960, Fig. 3, p. 75). 

However, the empirical curve of Fig. 3 
is not in agreement with results reported 
by Bousfield, Whitmarsh, and Esterson 
(1958). These authors used 5-, 10-, 20-, 
and 40-word lists all presented at a rate 
of 2.5 sec/word, and consistently found 
the primacy effect more marked than the 
recency effect. Both Bousfield et al. 
(1958, pp. 260-261) and Deese (1957, pp. 
581-582) suggest that the relatively slow 
presentation rate may have encouraged 
rehearsal and thus led to the greater 
primacy effect. To investigate this 
possibility, we conducted an additional 
experiment with 35 Ss using 10 20-2.5 
lists. The 20-word length was selected 
because the Bousfield et al. 

1958, Fig. 1, p. 258) seemed to show the 

most pronounced primacy effect for this 
length list. As Bousfield et al. (1958 
apparently used a 
recall period we 
period in this additional 
otherwise the procedure was identical 
with, that of the other experiments re- 
ported here. 

The results of the 
shown in Fig. 4. seen, in 
general the results are quite consistent 
with the empirical curve of Fig. 1, and in 


curves of 


longer 
recall 


somewhat 


used a 4-min. 


experiment ; 


experiment are 
As can be 
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particular the recency effect is more 
pronounced than the primacy effect. 
This experiment clearly shows that the 
results of Bousfield et al. (1958) are not 
due to the 
per se. 

Why did Bousfield et al. (1958) find 
primacy more pronounced than recency? 
One possibility is their instructions. 
Twice in their instructions they told Ss 
that the words were to be recalled, 

in the order in which they occur 
in your memory.” The stress on order 
may have given Ss a set to recall the 
words in the order presented, and Deese 
(1957) has shown that instructions to Ss 
are an important variable in determining 
the shape of the curve. A second possi- 
bility is the design used. Bousfield et al. 
1958) used a counterbalanced design 
such that each S had only one list at each 
length. Thus, in effect each list was 
(to S) of unknown length, and this fact 
may have encouraged rehearsal in the 
order of presentation. 

In any event, under the conditions of 
the present experiment there seems little 
doubt that the serial position effect of 
free recall is essentially as depicted in 
Fig. 3. Of course, as Deese (1957, p. 581) 
has noted, the serial position curve is 
sensitive to the introduction of experi- 
mental variables. However, it has been 
found that more items are recalled with 
free recall than with ordered recall 

Deese, 1957; Waugh, 1961), so evidently 
free recall is the preferred, perhaps even 
the more basic, method of recalling a list 
of unrelated words. 

Finally, why does the serial position 
curve of free recall take the shape it 
does? One possible explanation is in 
terms of short-term proactive and retro- 
active inhibition. That is, each word 
in a list is both preceded by anywhere 
from 0 to (L — 1) other words and fol- 
lowed by anywhere from (L — 1) to 0 
other words. Up to a point, the more 
preceding words the more short-term PI 
and the more succeeding words the more 
short-term RI. The PI and RI effects 
presumably summate to determine the 
total inhibitory effects. 

If this explanation is correct, recent 


slower presentation rate 
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studies of the short-term retention of 
individual items should provide an in- 
dication of the course of PI and RI to be 
expected. It has been found that, in 
short-term memory, PI effects appear to 
be greatest after about three prior words 
(Murdock, 1961). This agrees well with 
the finding that the primacy effect levels 
out after the first three or four serial 
positions. In short-term memory RI 
effects appear to approach an asymptotic 
value greater than zero (Murdock, 1961; 
Peterson & Peterson, 1959). This agrees 
well with the finding of a horizontal 
asymptote in the serial position curve. 
Finally, an examination of the RI curve 
of short-term memory even suggests an 
S shaped curve (see proportion of correct 
recalls over different retention intervals, 
Tables 1 and 3, Murdock, 1961, pp. 619- 
620). This agrees well with the Gom- 
pertz recency effect suggested here. 
Thus, it would appear that all the main 
characteristics of the idealized serial posi- 
tion curve shown in Fig. 3 are compatible 
with the results obtained from the short- 
term retention of individual items, and 


these findings lend support to the idea 
that the serial position curve of free 
recall is essentially a manifestation of 
short-term PI and RI effects. 


SUMMARY 


This experiment was a study of the serial 
position effect of free recall. 
obtained for 10-2, 20-1, 15-2, 30-1, 20-2, 
40-1 lists, where the first number indicates 
list length and the second number indicates 
presentation time per word. On the basis of 
the available evidence it was concluded that, 
under the conditions of the present experi- 
ment, the serial position curve is characterized 
by a 


Curves were 


and 


steep, possible exponential, primacy 
effect extending over the first three or four 
words in the list, an S shaped recency effect 


extending over the last eight words in the list, 


MURDOCK, JR. 


and a horizontal asymptote spanning the 
primacy and recency effect. Finally, it was 
suggested that the shape of the curve may 
well result from proactive and retroactive 
inhibition within the list 
itself. 


effects occurring 
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These experiments began as an at- 
tempt to apply the method of magni- 
tude estimation to a continuum for 
which the stimulus seemed to have no 
metric scale, only an ordinal scale of 
grades of sandpaper, or emery cloth. 
l'nexpected discoveries led on to more 


engaging inquiries. At the outset, 
both a ratio scale and a category scale 
of apparent tactual roughness were 
determined with 12 grades (grits) of 
emery cloth. The relation between 
the ratio scale and the category scale 
was typical of the relation found on 
prothetic continua. 
results, 


For these first 
1961a; for an 
Dudek & 


Stevens, 
earlier related 
Baker, 1956.) 


see 


study , see 


[he next study was an exercise 
Harris and J. 


hon, asked 12 Os to judge the 
the stimuli instead of the 


in which 
P. McMa 


smoothness ot 


two students, C. S 


roughness The 
of smoothness approximated the 
inverse, or reciprocal, of the ratio scale found 
for roughness, and the 


ratio scale 


scale of 
smoothness was the reverse, or the comple- 
ment, of the 
These 


category 


category scale of roughness. 
(1960) 
findings when he scaled both the apparent 
lightness and the apparent ,darkness of gray 


papers 


results resemble Torgerson’s 


In terms of a linear scale of apparent 
roughness, it turned out that the stimuli used 
bunched rather tightly at the low 
smooth) end of the continuum, so much so 
that the category were almost 
logarithmic functions of the respective ratio 
Other studies of scaling 
that the nearly logarithmic form 


found here is an accident of the stimulus spac- 


were 


two scales 


scales category 


suggest 


ing, and that if an iterative procedure were 
used to arrive at a “‘pure”’ category scale the 
curve for roughness would be less curved than 


1 This research was supported by Grant 
G-10716 from the National Science 
tion (Report PAR-266) 


Founda- 


a logarithmic function 
1957). 

Since the availabl cloth 
grits is limited, it is difficult to determine a 
pure category 


grits 


Stevens & Galanter, 


number of emery 


scale, but when a sample of 
uniformly 
roughness was used, the form of the category 
scale changed as predicted: it became much 


less curved than 


more spaced in subjective 


a logarithmic function when 
plotted against the ratio scale of subjective 
roughness. Ten Os judged grits 320, 120, 
80, 50, 40, 30, and 24 twice each on a seven 
point scale. The average judgments were 
1.17, 2.17, 3.04, 4.12, 5.17, 6.08, and 6.62 
These values determine a line that is straighter 
than a logarithmic function—a line that is not 
far from the pure form of the category scale, 
is evidenced by the tendency of Os to use each 
category number approximately equally often 
Stevens & Galanter, 1957). It 
therefore, that roughness 
prothetic continuum, and 


category scale is not 


appears, 
like a 
pure 


behaves 
that the 

a logarithmic 
Eisler, 1962 


func tion 
of the magnitude scalk 


from these preliminary studies a 
surprising fact emerged: magnitude 
estimations of and of 
smoothness, turned out to give fairl 
straight lines when plotted in log-log 
coordinates against the grit number of 
the emery cloths. Another instance, 
it seems, of the psychophysical power 
law. (Grit refers to the 
number of openings per inch in the 
screen employed to sift the abrasive 
particles. ) 


roughness, 


number 


If apparent roughness and 
its reciprocal, apparent smoothness, 
are power functions of particle size, it 
becomes a challenging task to deter- 
mine more accurately the exponents 
involved. 
ments, 


In the preliminary experi- 
each involving 12 Os, the 
approximate exponents were — 1.5 for 
roughness and +1.2 for smoothness 
when measured against grit number. 
The next problem was to determine 


189 
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which of these 
either, is more 
absolute value. 


two exponents, if 


nearly correct in 


APPARATUS AND PROCEDURI 

The stimuli were the twelve grits, 320, 240, 
220, 180, 120, 100, 80, 60, 50, 40, 30, and 24. 
It was assumed that these grits, Tri-M-ite 
brand, met the published standards for 
abrasives (Horton, 1957) which allow the 
grain size to vary with a standard deviation of 
approximately 20% around the nominal size. 
Two different sets of cloths were used. To a 
sufficient approximation for the present pur- 
pose, the grit number can be regarded as 
proportional to the reciprocal of the grain 
diameter. 

The stimuli were presented one at a time 
to O, who placed his hand through a cloth- 
screened opening. He stroked the emery 
cloth twice with his index and middle fingers. 
Two samples of each of the 12 grits were 
presented in a different 
each O. 

Two experiments used the method of 
magnitude estimation, and one used cross- 
modality matching, as follows. 

Experiment 1 (with assigned modulus) 
Grit 100 was presented first and O was told 
to call it 10. Of the 20 Os, 10 first judged 
roughness and on a later date judged smooth- 
ness. The other 10 reversed the order. 

Written instructions were given to each 0. 
When roughness was 
instructions were: 


irregular order to 


being judged, the 


I am going to present a series of surfaces 
that vary in roughness. Your task is to tell 
me how rough they feel by assigning num- 
bers to them. The first will be the standard 
roughness, which we will call 10. Your task 
is to assign numbers proportional to your 
subjective impression. 
bers seem appropriate—fractions, decimals, 
or whole numbers. For example, if a sur- 
face feels 3 times as rough as the standard, 
say 30; if it feels } as rough, say 2, etc. 
Try not to worry about being consistent; 
try to give the appropriate number to each 
surface regardless of what you might have 
called some previous surface. 
surfaces, draw your index and 
fingers twice across each 
presented. 


Use whatever num- 


In feeling the 
middle 


surface as it is 


When the task was to judge smoothness, 
the words “smooth” and ‘‘smoothness’’ were 
substituted for “rough” and “roughness” in 
the preceding instructions 


S. S. STEVENS AND JUDITH RICH HARRIS 


Experiment 2 (no assigned modulus).—The 
first stimulus presented was different for 
each O, and, instead of there being a modulus 
called 10, the instruction was to call the first 
stimulus “any number you think appro- 
priate.”” Roughness and smoothness were 
judged on different days by each of 10 Os. 

Experiment 3 (cross-modality matching). 
The O adjusted the intensity of a band of 
noise (500 to 5000 cps) until its subjective 
magnitude appeared to match the subjective 
magnitude of the roughness (or smoothness) 
of each grade of emery cloth. Two matches 
were made to each of the 12 cloths in an 
irregular order by 10 Os. One group of 10 Os 
matched loudness to roughness; another group 
of 10 Os matched loudness to smoothness. 

The loudness was controlled by a “sone 
potentiometer” (two 2000-ohm potentiome- 
ters, ganged and cascaded). An additional 
attenuator in series enabled E to keep O's 
adjustments of the sone potentiometer more 
or less centered in the usable range. The 
voltage across the PDR-8 earphones was 
measured with a vacuum-tube voltmeter. 


RESULTS 


The geometric means of the results 
of the 20 Os who judged roughness 
and smoothness in terms of an as- 
signed modulus (Grit 100 called 10) 
are shown in Fig. 1. The slopes of the 
two straight lines are equal but of 
opposite sign (+1.4 and —1.4). 
This is what is called for by the 


reciprocal relation between roughness 


and smoothness. On the other hand, 
the points do not always fall close to 
the lines. In general, there is a 
tendency for both functions to be 
slightly concave downward. 

It is also of interest that the stand- 
ard called 10 at the beginning of each 
run was judged less rough when it was 
presented again as a stimulus to be 
judged (geometric mean 6.43). 
When the task was to judge smooth- 
ness, the standard called 10 was later 
judged more smooth (geometric mean 

13.03). <A similar adaptation—if 
that is what it should be called 
also noted in the 
periments. 


was 
preliminary ex- 
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Fic. 1 rhe 


estimations of 


geometric means of the 
roughness 
circles) are plotted against grit 
number in log-log coordinates. 


is based on 40 judgments—20 Os 


triangles) and 


smoothness 


Each point 


Experiment 2; with no assigned 


modulus, gave the results shown in 
Fig. 2. The points (geometric means) 
lie closer to the straight lines and the 
slopes are +1.5, a slightly greater 
absolute value than in Exp. 1. Allow- 
ing each O to choose his own modulus 
appears to have produced a _ better 
result. This added freedom has led 
Os to give superior results on other 
occasions when no standard was used 
(Stevens, 1956). Other things being 
equal, it is better in experiments with 
magnitude 
with an assigned 
iverage by 


estimation to dispense 
and to 
geometri 


No prior 


modulus 
taking the 
means of the judgments. 
processing of the data is necessary 
with geometric averaging. 

The results of matching loudness to 
and to smoothness 


roughness are 
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Fic. 2. Similar to Fig. 1, except that in 
these experiments on roughness (triangles) 
and smoothness (circles) no standard stimulus 
was designated by E. (Each point is based 
on 20 judgments—10 Os.) 


N DECIBELS 


LEVEL 


SOUND PRESSURE 


a 80 60 40 


GRIT NUMBER 


Fic. 3. Sound pressure levels in decibels 
re 1 microbar produced in the earphones when 
Os matched the loudness of a noise to the 
roughness (triangles) and the smoothness 
circles) of the 12 grades of emery cloth 
Each point is the decibel average of 20 
judgments—10 Os.) 
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: 


shown in Fig. 3. The decibel averages 
of the sound pressure levels produced 
when Os matched loudness to rough- 
ness and smoothness produce straight 
lines when plotted against a logarith- 
mic scale of grit numbers. This is the 
predicted outcome if the 
physical power law holds. 
As has been repeatedly demon- 
strated (Stevens, 1959, 1961b) the ex- 
ponents (log-log slopes) obtained in 
cross-modality matches should be 
equal to the ratio between the expon- 
ents of the two modalities determined 
by magnitude estimation. If we take 
the exponent for loudness vs. sound 
pressure to be 0.6 (Stevens, 1955), and 


psyche )- 


the exponent for roughness vs. grit 
diameter to be 1.5, the predicted slope 
of the matching function in Fig. 3 is 
2.5. The measured slope is 2.6. F 
the measured 
-2.6, as expected. In view of the 
uncertainty in 
these experiments, it is reassuring that 
the measured and the predicted ex- 
ponents in cross-modality 
agree within 4%. 
matches suggest that the roughness- 
smoothness exponents may be slightly 
greater than 1.5. If we 


smoothness, slope 


sources of error and 


matching 
The cross-modalit: 


could vary 


roughness to match loudness, presum- 


ably the exporent would be 
still 1959) but 
plementary experiment would be diffi- 
cult to execute. 


larger 


(Stevens, this com- 


In order to provide a unit and a 
formula for subjective roughness, it 
appears reasonable for the time being 
to take 1.5 as the exponent and to 
take the apparent roughness of Grit 
320 as the subjective unit. If a name 
for this unit is desired, the term ruk, 
derived from a root cognate of rough, 
is suggested. Accordingly, the equa- 
tion for subjective roughness KF in 
ruks, as a function of grit number G 
becomes 


R 5724G 
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In terms of the average diameter in 
millimeters of the abrasive particles, 
the exponent would be +1.5, and the 
constant would be 106.5. This latter 
value is based on the sieve openings in 
the United States 
(Horton, 1957). The equation omits 
the “threshold” constant, because the 


standard series 


available stimuli did not permit its 
evaluation. 

The uncertainties regarding 
the size of the exponent for roughness 


main 


stem from two causes, each of which 
The 
available range of stimuli was rela- 
tively short, about 1.12 log units, a 
factor that would be expected to in- 
the On 
the other hand, the experiments in- 
volved the numbers 
or of loudnesses) to roughness (or 


would have an opposite effect. 


crease measured exponent. 


matching of 
smoothness), never the reverse. Ex- 
that un- 
balanced design tends to decrease the 
Whether the two 
sources of presumed bias have equal 
as well as opposite effects cannot be 
told without further 
analvsis. 


perience has shown this 


measured exponent. 


experimental 


DISCUSSION 


Reciprocality.—It is clear from Fig. 
1, 2, and 3 that values obtained in the 
judgment of smoothness are approxi- 
mately the reciprocals of values for 
apparent roughness. It seems likely that 
any continuum can be judged with at 
least fair success in terms of its reciprocal, 
although only a few have been looked 
at from this point of view. In addition 
to the continuum roughness—smoothness, 
there data lightness-darkness 
of surfaces) and loudness-softness (of 
noises). 


are on 
One can easily imagine judging 
the longness or shortness of lines, the 
strength or weakness of vibrations, the 
brightness or dimness of luminances, the 
heaviness or lightness of lifted weights, 
and soon. The inverting of a continuum 
is a kind of semantic matter: O is in- 
structed differently, and he tries to 
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respond with numbers that are inversely 
proportional to those he would use for 
the continuum “right side up.’’ At least 
he tries to report reciprocals provided he 
understands the semantic rule: a surface 
that is twice as rough is half as smooth. 

Some continua, like roughness, are 
frequently referred to in terms of the 
inverse aspect, but for many continua O 
would probably find it a little strange to 
be told to judge the reciprocal. Even 
with smoothness, a few Os commented on 
the difficulty of the task. It seemed 
easier and more natural, for example, to 
match loudness to roughness than to 
match loudness to smoothness. 

Even though people may become 
customed to judging in terms of 
inverse aspect, there is a sense in whir 
what may be called the degree of 
stimulation is basically different from the 
degree of its absence. When he 
the degree of absence of stimulati 
manages, with fair success, to repcor 
reciprocal of his judgment of the strengc! 
of the sensory magnitude. A related 
type of report was tried out by E. C 
Poulton who asked Os to estimate frac 
tional loudness by only the 
denominator of their report (see Stevens, 
1956). As the tones became fainter the 
numbers became larger in a manner that 
produced a nice reciprocal relation to the 
standard results of magnitude estima- 
tion. There are many ways of asking O 
to report the apparent magnitudes of a 
series of sensory excitations, but it seems 
hardly likely that all methods of report 
that are logically equivalent will give 
equally good results. 

It is also important to note that some 
sensory attributes that may be thought 
of as opposites are not at all reciprocally 
related. Warm and cold are two striking 
examples (Stevens & Stevens, 1960). 
The exponent for warm is 1.6, for cold 
1.0. These different 
sense modalities, not two names for the 
continuum, and the stimulus 
domains of the two continua do not 
overlap. Presumably experiments could 
be run in which O told how warm or how 
neutral the stimulus felt, and the results 
would follow the reciprocal relation, An 


varying 


seem to be two 


same 
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analogous experiment could be run with 
O judging cold or neutral, again with the 
expectation of obtaining reciprocal func- 
tions, but the slopes (exponents) would 
differ greatly from the slopes obtained 
with the warm-neutral pair. One must 
distinguish, therefore, between word 
pairs that refer to two different continua 
and word pairs that refer reciprocally to 
a single continuum. A given tempera- 
ture is usually either warm or cold, but a 
given surface is both rough and smooth. 

The stimuli.—The stimuli used in 
these studies were those samples of emery 
cloth that happened to be available in the 
laboratory. They appeared to be in good 
order, and they seemed to have been a 
good set of stimuli, in the sense that they 
gave rise to orderly power functions 
when used in eight different experiments. 
It is noteworthy, however, that, as the 
experiments accumulated, the evidence 
became increasingly clear that there were 
minor peculiarities in the series of grits, 
or perhaps in the manner of their ad- 
hesion to the cloths. It is not usual to 
argue from the results of magnitude 
estimation to a nonuniformity in a manu 
facturing process, but the evidence for 
at least one such ‘‘defect’’ seems clear. 
The cloth with Grit 120 was consistently 
judged too rough relative to Grit 100. 
This fact is especially evident in Fig. 2 
and 3 and was also clear in the prelimi 
nary experiments. 

Under the microscope one can see that 
Grit 120 involves smaller abrasive grains 
than Grit 100, as indeed it should, but 
the two cloths differ in the degree to 
which the particles appear to be im 
mersed in the adhesive. The finer par 
ticles (Grit 120) appear to sit higher on 
the cloth and to present more of their 
surface to view. To the touch, the two 
cloths (100 and 120) feel different in a 
that best be 
qualitative. The skin catches occasion 


way can described as 
ally on the finer particles in a way that is 
not characteristic of the coarser particles 
This catching, due presumably to a 
shallower immersion of the particles in 
the bonding adhesive, may account for 


the relatively higher numerical estima 
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tions of the apparent roughness of the 
finer grit. 

Whatever the explanation, it becomes 
clear that particle size is not the only 
variable in a bonded abrasive that 
can affect apparent roughness. Never- 
theless, the samples used in these ex- 
periments seem to have been only 
minimally contaminated by other factors, 
for sensed roughness grows as a relatively 
clean power function of average particle 
diameter. 

SUMMARY 


Preliminary experiments showed that Os 
can make consistent judgments of 
roughness and smoothness. The stimuli were 
12 grits of emery cloth. Magnitude estima- 
tions of roughness and smoothness produced 
straight lines when plotted (log-log) against 
grit number. The exponents of these power 
functions were determined in two experiments 
with magnitude estimation and one with cross 
modality matching loudness. All 
three experiments gave results that 
power functions of grit number with ex- 
ponents in the vicinity of —1.5 for roughness 
and +1.5 for 
modality matches also confirmed the expon 
ents determined by magnitude estimation 


tactual 


against 


were 


smoothness. The cross 


REFERENCES 


DubEK, F. J., & BAKER, K. E. The constant 
sum method applied to scaling subjective 


S. S. STEVENS AND JUDITH RICH HARRIS 


dimensions. 
616-624. 

EIsLER, H. Empirical test of a model relating 
magnitude and category Scand. 
J. Psychol., 1962, 3, 88-96. 

Horton, H. L. (Ed.). Machinery's handbook. 
(15th ed.) New York: Industrial 
1957. P. 1459. 

STEVENS, J. C., & StEvENs, S. S. Warmth 
and cold: Dynamics of sensory intensity. 
J. exp. Psychol., 1960, 60, 183-192. 

STEVENS, S.S. The measurement of loudness 
J. Acoust. Soc. Amer., 1955, 27, 815-829. 

STEVENS, S. S. The direct 
sensory magnitudes: Loudness 
Psychol., 1956, 69, 1-25. 

STEVENS, S. S 


Amer. J. Psychol., 1956, 69, 


scales 


Press, 


estimation o 
Amer. J. 


Cross-modality validation of 
subjective scales for loudness, vibration, 
and electric shock. J. exp. Psychol., 1959, 
57, 201-209. 

STEVENS, S.S. ‘To honor Fechner and repeal! 
his law. Science, 1961, 133, 80-86. (a 

STEVENS, 5. S. The psychophysics of sensory 
function. In W. A. Rosenblith (Ed 
Sensory communication. New York: Wiley, 
1961. Pp. 1-33. (b 

STEVENS, 5. S., & GALANTER, E. H 
scales 


Ratio 

and 
perceptual continua J 
1957, 54, 377-411 

roRGERSON, W. S. Quantitative judgment 
scales. In H. Gulliksen and S. Messick 
Eds.), Psychological scaling. New York: 
Wiley, 1960 


scales for a dozen 
exp. Psychol . 


category 


Received October 21, 1961 





Journal of Experimental Psychology 
1962, Vol. 64, No. 5, 495-504 


AN EVALUATION OF THE ACTIVATIONIST HYPOTHESIS 
OF HUMAN VIGILANCE! 


JACK A. 


ADAMS anp LAWRENCE R. 


BOULTER 


University of Illinois 


The arousal or activationist hy- 
pothesis of human attentiveness holds 
that the stimuli impinging on S from 
external sources, or acting from within 
him, determine alertness through the 
reticular activating system of the 
brain (Broadbent, 1958; Fiske, 1961; 
Krankmann & Adams, 1962; Lindsley, 
1957; Malmo, 1959). Studies of 
human vigilance, or long-term atten- 
tiveness for occasional signals in a 
monitoring task, commonly show a 
decrement in 
function of 

Scott (1957) interprets this  phe- 
nomenon as adaptation to the nearly 
unchanging stimulation. Frankmann 
and Adams (1962) have pointed out 
that 


ubiquitous as is sometimes believed, 


signal detection as a 


observation time, and 


vigilance decrement is not as 


and that investigators have failed to 
ask why decrement is more promin 
ently associated with simple tasks and 
is absent or small in complex visual 
tasks with multiple stimulus sources 
In terms of the 
pothesis, there appear to be special 
sources of 
tasks. 
Efforts to use explicitly 


activationist hy- 


stimulation in complex 
the hy- 
pothesis in a careful predictive sense 
for vigilance findings are hampered by 
an absence of relationships between 
the locus, amount, and type of stimu- 
lation, and measures of molar behavior 
(Adams, Stenson, & Humes, 1961; 
Frankmann & Adams, 1962). How 
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Force Systems Command. 


ever, with the hypothesis firmly rooted 
in studies of the reticular formation, 
there is a number of physiological 
findings that can be used to facilitate 
a search for the missing definitions and 
relationships at the molar level. The 
two experiments reported here reason 
from physiological findings and seek 
to identify sources of stimulation that 
can deter decrement in complex visual 
tasks. was that a 
special source of stimulation in com- 
plex tasks can be the proprioceptive 
stimulation associated with the head 
and eye movements of scanning the 
stimulus array 


One derivation 


and it is based on 
proprioception collaterals in the re- 
ticular formation (Rossi & Zanchetti, 
1957; 1959). When the 
stimulus sources have wide spatial 
separation, head movements would be 
the most prominent source of pro- 
prioceptive but even 
when multiple stimulus sources are 
quite close together we can still expect 
proprioceptive stimulation from eye 
movements alone (Cooper, Daniel, 
& Whitteridge, 1955; Whitteridge, 
1960). The changing retinal stimula- 


Samuels, 


stimulation, 


tion as the head and eyes scan the 


visual scene is an 
stimulation. 
Another derivation was that re- 
sponse complexity can produce in- 
ternal stimulation that should 
crease the amount of decrement, and 
its interaction with the proprioception 
and visual variables was included to 
illuminate the effects on vigilance of 
another source of stimulation § in 
complex tasks. Physiologically, com- 
plex responses might be expected to 


added source of 


de- 
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influence alertness through cortical 
outputs fed back to the reticular 
system (Adams et al., 1961; Lindsley, 
1957; Rossi & Zanchetti, 1957; 
Samuels, 1959). Adams and Boulter 
(1960) and Adams, Stenson, and 
Humes 


(1961) have shown that a 


relatively complex decision response 


of four choices associated with each 
signal will the 
amount of vigilance decrement. 


detection decrease 


EXPERIMENT | 
\VWethod 


{ pparatus. units of the multiple 
source vigilance apparatus were used, with 2 
Ss tested at a time. The S sat at a desk and 
faced a semicircular track fixed at eye level 
to the back of the desk. On the track, and 30 
in. from S’s eyes, were mounted four small 
(14 X 2 in. front digital display 
that presented a two-digit number 
(#s in. high) which was the critical signal S 
had to detect. 


used. 


lwo 


surlace) 
boxes 


Six different numbers were 
The number was always bright enough 
for easy reading, but the brightness level was 
intentionally low so S had to orient to a stim- 
When the 
critical signal came on, S was instructed to 
report the event as fast as possible by pressing 
a detection button located 4 in. from his 
fingertips. A timer started at the onset of the 
signal and stopped when the detection buttor 
was touched, thus giving a response latency 
score. This simple mode was called detection 
responding. Under another experimental con- 
dition called memory responding, which is 
described in more detail below, S had a set 
of six memory buttons in addition to the 
detection button. These buttons were each 
labeled the numbers and were 
irrayed in a semicircle around the detection 
button, and 4in. from it. After S pressed the 
detection button as rapidly as possible, he 
further selected 
button, 


ulus source to see the signal easily. 


with one of 


and pressed a memory 
as possible. The 
latency from the onset of the signal to pressing 
the memory button was recorded too. An S 
responded with his preferred hand, but he 
always kept both forearms in an armrest to 
standardize the position of the arm and hand 
Latency measures such as these have been 
found to have wide variability when the 
starting position for the hand is not con 
trolled. A microswitch under each armrest 
turned on a light at Z’s panel whenever S r 


also as rapidly 


LAWRENCE 


R. BOULTER 


moved an arm from the armrest and, when 
this occurred, E administered a brief reminder 
over an intercom system. An 
reminder in the practice 
prec eded 


occasional 
which 
was usually 
sufficient to standardize this procedure. All 
signals and intersignal intervals were pro 
gramed and automatically read and timed by 
a digital tape reader that fed the two ap- 
paratus units simultaneously. Each 
in his own experimental room, which had 
normal illumination. The £ and his console 
were located in another building. 

Description of stimulus series. 


session, 


criterion sessions, 


S was 


Eac h ses 
sion was 2.5 hr. long, and for scoring purposes 
was divided into five 30-min. trials. There 
were eight signals on a trial, and each signal 
lasted for 5 sec. The signals were the numbers 
10, 20, 30, 40, 50, and 60, and each number 
occurred at least once on a trial. Two numbers 
appeared at each of the four stimulus sources 
on atrial. Intersignal intervals on each trial 
were .7, 1.2, 2.0, 3.0, 4.0, 5.0, 6.2, and 7.5 
Both the intersignal 
intervals, and of signals to the four stimulus 


min. assignment of 
sources, were separately randomized for each 
trial. By repeated random sampling, a pool of 
10 tapes was generated according to these 
rules 
Experimental 
pothesis 


proe edures 
that vigilance in 
complex visual tasks is a function of the 
proprioceptive and retinal stimulation arising 
from head and movements, and the 
manipulation of this class of stimulation was 
by spatial separation of the four stimulus 
sources. The assumption was that the wider 
the separation of sources, the greater the head 
and and the 


\ major hy- 


was sustained 


eye 


eye movements, greater the 


stimulation. Four amounts of spatial separa- 
tion, difference 
were 18 

Che other two sources 


specified in terms of the 
between the two outermost sources, 
and 144 
were spaced between the two peripheral ones 
to give equal separation between all four. 
Separations of 18° and 36° placed the four 
field of visual view, and 
they could be scanned with eye 

ilone The 72° separation 
idded head movements for 
comfortable scanning, and scanning the 144 
eparation was not without head 


36°, 72 


sources in a direct 
essentially 
movements 
required some 


possible 
movements 
\ second 


sponse complexity is a 


major hypothesis was that re 
internal 


for high alertness in complex 


source ol 
stimulation 
tasks 


ised. 


Two levels of response complexity were 
One was detection responding, which 
was the simplest, that only required S to 
press the detection button when a signal was 
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detected. The other was memory responding, 
which had much more complexity, and it 
required S to actively employ his immediate 
memory throughout the 2.5 hr The S 
always had to remember the last set of four 
numbers that 


As soon as he detected a new number 


appeared at the stimulus 
sources 
at a source, his response was to press first the 
detection button and then press the memory 
button labeled with the number that had ap 
peared at that same source the last time 
rhis requirement for short-term memory was 
quite similar to running memory 
Pollack, Johnson, & Knaff, 1959 
hypothesized that the 
requirement would introduce greater internal 
stimulation than detection responding 


span 
‘ and it was 


continuous memory 


[wo groups of 15 Ss each were used, wit! 


each S participating for five sessions \ 


mixed analysis of variance design was used 


(Lindquist, 1953, p. 292 Response com 
plexity was a between-Ss variable, with one 
group having detection responding through 
out and another memory reé 
Within-Ss variables were five trials 
spatial and 


administered to all Ss of both groups 


group having 
sponding 
and four separations were 
Fol- 


randomly 


assigned spatial separation o! sources, ich S 


lowing a practice session with 


was given four criterion sessions where each 
session was a different one of the four degrees 
S did not have 
two identical separation conditions in a row 


of spatial separation. So that 
the three separation conditions that were 
used in the practice 
divided among the Ss for the 


SCSSIOI were equally 
first criterior 
session The separation condition for the 
practice 


( riterion 


session was assigned to the last 
session, and the remaining two 
separations were randomly assigned to the 


second and third criterion sessions. To cor 

trol for the possibility of some learning of the 
characteristics of a particular input tape over 
five sessions, a t ipe from the pool of 10 tapes 
randomly to an S for eacl 


was assigned 


session, with the restriction that no tape be 
ised twice for any S. Each session was o 
different day 

Subjects The 30 Ss were male 
graduate students who were 


They 


pa d lor 


participation were randomly ass 


to groups. 
Results 
The 
high, as might be expected with 
Both 


level of signal detection was 


signal that persisted for 5 sec. 
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detection and memory responding had 
a detection level of 98%. 

Figure 1 shows the main findings. 
Each S’s score was the mean latency 
of his responses on a trial, and Fig. 1 
presents the plot of group mean 
latencies as a function of trials. The 
latencies presented in Fig. 1 are for 
response to the detection button. 
Latencies increased as spatial separa- 
tion increased, and memory respond- 
ing had longer latencies than de- 
tection responding. The differences 
found between detection and memory 
responding were found to be pri- 
marily associated with pressing the 
detection button, indicating that all 
recall and decision delays about which 
of the six memory buttons to press 
took place before the main detection 
button was pressed. The difference 
between the latency to the 
detection button and the latency to 
press the subsequent memory button 
is the time to move from the detection 
button to the memory button, and the 
means of these difference 


press 


latencies 


TABLE 1 


\NALYSIS OF VARIANCE IN Exp. | 


source 


Between Ss | 
Detection-Memory 
‘ 95.88 | 


Error; 3.01 | 


Within Ss 
Trials (A) 


Spatial separation 
B 


1.23} 13.67* 


20.08 | 59.06* 


0.05 
0.07 
0.09 
0.34 
0.10 | 


Error, (w 
Errore (w 
Error; (w 


Hm NN We tS Ww 


>= 


otal 


* Error used to test C effect Error:i(w) used 

\ and A XC, Error:(w) used for B and B 

Errors(w) uset for A Kk Band A XB x (¢ 
*pP. 
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decremental 
trends, and Fig. 1 shows decrement 


were found to be essentially constant sponse complexity on 
for all conditions. These latter data 
have been omitted here. to be associated with all conditions. 

Of primary interest was the in- If our hypotheses about the effects 


fluence of spatial separation and re- of spatial separation and response 


AVERAGED OVER BOTH RESPONSE CONDITIONS 
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Mean response latency as a function of the two types of response complexity 
and the four separations of stimulus sources: Exp. I, 
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complexity on decrement are sound, 
significant interactions should be ex- 
pected. Less decrement over trials 
should be found for large spatial 
separations and for memory respond- 
ing. The results of the analysis of 
variance in Table 1 show that the 
hypothesized interaction effects were 
not confirmed. All main effects, how- 
ever, were significant. 


EXPERIMENT I] 


The activationist hypothesis does 
not clarify the dimensional character- 
istics of stimulation that influence 
performance (Adams et al., 1961), 
and it could be that our operations in 
Exp. I did not define those particular 
characteristics of stimulation that 
deter vigilance decrement. The nega- 
tive results of Exp. I may have 
resulted from the manipulation of 
amount rather than the vartety of 
stimulation that has been found to be 
important by other 
Sharpless and Jasper 


investigators. 
(1956) found 
that stimulus variety induced alert- 
ness in the cat, and McGrath (1960) 
found that visual detection 
heightened by varied auditory stimu- 
lation such as music, but not auditory 
white Experiment II 
performed to see 


was 


noise. was 
if change in the 
pattern of head and eye movements 
could influence vigilance decrement. 
The plan of the experiment was to 
establish a pattern of visual observing 


by procedures analogous to operant 
reinforcement techniques and then, at 
a point in a session, require a change 
in the pattern. 


VWethod 


Apparatus.—The same task was used, but 
it was modified by mounting a small but 
distinctive neon cue light on top of 
stimulus source. 


each 
In addition to programing 
critical signals, whose duration was shortened 
to 2 sec. in this experiment, the tape input 


mechanism also programed the cue lights 
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from source to source at 2-sec. intervals. 
The S was instructed always to watch the cue 
lights as they changed from source to source 
because the signal, whenever it occurred, 
would be coincident with an illuminated cue 
light. One way of viewing the modified task 
is in operant reinforcement terms where 
visually following the changing cue lights is a 
response class that is rewarded when S sees a 
number as it first comes on and has the 
success of an early detection. By faithfully 
following the cue lights, S tends to optimize 
his rewards and his performance. In this 
fashion, we sought to shape a pattern of 
observing responses that could be subjected 
to change. 

Experimental procedures.—The 144° spatial 
separation was used for all groups to insure 
an ample observing response. Detection re- 
sponding was used for all groups. One type of 
observing response was repetitive (R) where 
the cue light moved regularly, e.g., 
1,2, 3,4,3,2,1... etc., so that S scanned 
regularly back and forth. The other type of 
observing unsystematic (U 
where the occurrence of cue lights was random 
with the restriction that the relative frequency 
at a source be the same as the R condition 
Except for duration being reduced to 2 se 
the occurrences of critical signals on the input 
tapes were the same as in Exp. |. The shorter 
signal duration was used to place greater 
premium on close following of the cue lights 
because SS would soon become aware that he 


Sources 


response was 


would stand a chance of missing signals if he 
did not follow the cue lights closely. The 
length of the session was again 2.5 hr., so the 
shortening of signal duration slightly i 
creased the length of intersignal intervals 

Four independent groups of 15 Ss each 
were used. All Ss first had a practice session, 
a subsequent day had a criterion 
Group RU had R scanning through- 
out except on Trial 3 of the criterion session 
when it was changed to U scanning. Group R 
was a control group for Group RU and had R 
scanning throughout both sessions. if the 
change of scanning pattern for Group RI 
influenced vigilance decrement, it would be 
detectable by comparison with Group R 
which had nochange. Asa check on the type 
of scanning and the effect of its change, 
another group, UR, had U scanning except 
when it was changed to R on Trial 3 of the 
criterion session. Group U was the control for 
UR, and had U scanning for both sessions. 

Subjects —The 60 Ss were university male 
undergraduates who were paid for 
participation Assignment to 
random. 


and on 


sessic mn. 


their 
groups was 
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Results 


As in Exp. |, percent detection was 


high (98%), and the basic score for S 


again was the mean of his response 
latencies on a trial. A plot of group 
means as a function of trials is shown 
in Fig. 2. Group R, with repetitive 
visual scanning, had lower 
latencies than Group U with un- 
systematic scanning, but both groups 
had decrement. 
trends 


meal 


These apparent 


were confirmed in a mixed 


analysis of variance (Lindquist, 1953, 


p. 267) where type of scanning (I 
and R) was the between-Ss variable, 
and trials the variable 
Type of scanning gave an F ratio 
(df = 1/28) of 19.65 (error IS = .17) 
which was significant at less than the 
O01 level. Trialshadan F (df = 4/112) 
of 9.11 (error MS .009) which also 
was significant at less than .01. The 
interaction these two main 
effects gave an F (d/ 4/112) of .44 
(error MS = .009) which lacked sig- 
nificance at the .05 level. 


within-Ss 


between 


Trial 3 was the point of variation 
in scanning for Groups UR and RU, 
and Fig. 2 shows that the effect of the 
change was to shift a Trial 3 mean in 
the direction of its 


control group. 
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Mean response latency for 
groups of Exp. II 
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The ¢ test for independent measures 
was used on Trial 2 latencies between 
Groups RU and R, and Group UR and 
U, to establish comparability of 
groups before the point of change. 
The ¢ ratio for the mean difference 
between RU and R was .05 
MS .004), and for the difference 
between UR and U was 1.64 (error 
MS = .006). Both t's lacked sig- 
nificance. Groups RU and U, and 
Groups UR and R, were then evalu- 
ated with the ¢ test at Trial 3, the 
point of change. The ¢ for the differ- 
between RU and U was .86 
MS = .005), and UR and R 
was 1.05 (error WS = .005). Neither 
of these two ?t’s was significant at the 
.O5 level. To check if there was a 
persisting effect in the trial that 
followed, comparisons of Groups Rl 
and R, and Groups UR and U, were 
made on Trial 4. The ¢ for the 
difference between RU and R was .74 
(error MS = .005), and the ¢ for the 
comparison between UR and U 
1.50 MS .006). 
was significant at the .05 level. 
all of these ¢ tests, df = 1/28. 


(error 


ence 
(error 


was 
Neither f 
For 


(error 


statistics 
must be founded on evidence that the 
cue lights 
havior. 


Interpretation of these 
observing be- 
First, cue lights produced 
iower mean latencies than when they 
were not used, presumably because S 
was directed where to look and was 
often reinforced when he looked there. 
Detection responding with 144° sep- 
aration in Exp. I is essentially the 
same task as given Groups U and R, 
but without cue lights. 


changed 


Group U had 
almost twice the speed of responding 
as the corresponding condition in 
Exp. 1, and Group R was almost three 
times faster. A and 
direct basis for inferring that cue 
lights influenced the observing re- 
sponse is shown in Fig. 3, where mean 


second more 


response latency to subsets of stimulus 
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GROUP RU 


INNER 
STIMULUS SOURCES 


——-—PERIPHERAL 
, STIMULUS SOURCES, 


| 5 


.85 








TRIALS 


lor the four groups of Exp. II, mean response later 


and the outer two stimulus sources 


lor each S a 
latency score was computed as the 
mean response trial to 
of the inner stimulus 
sources, and similarly another latency 


sources is presented. 
time on a 
signals two 
score was computed for signals on a 
trial for the two peripheral stimulus 
shows the trial 


sources. Figure 3 


If 
response under 
and § systematically 
following the cue lights, then equal 
to all 
3 shows 


means for each group. the ob- 


serving was close 


control was 


attentiveness sources is €x- 
that the 
expectation was realized for Group R. 


Not only was their performance level 


pected. Figure 
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high, but it was the same for both 
the inner and peripheral stimulus 
sources. Less success was achieved by 
Group U which had_ performance 
noticeably poorer for peripheral 
sources. The larger excursions of head 
and eye movements apparently did 
not fall under control when 
eliciting cue lights changed unsys- 
tematically. Nevertheless, some 
measure of overall success was evident 
for Group U because its mean per- 
formance level was better than that 
for the corresponding condition of 
detection responding and 144° sep- 
aration in Exp. I. 


close 


Group R and Group U each had a 
separate three-way analysis of vari- 
ance performed on their data shown 
in Fig. 3. There was one measure per 
cell, and trials was one variable, inner 
vs. peripheral a second, and Ss a 
third. For Group R the trials vari- 
able had an F ratio (df = 4/56) of 
6.50 (error MS = .014) which was 
significant at less than the .01 level. 
The inner vs. peripheral variable gave 
an F ratio (df = 1/14) of 2.44 (error 
MS = .009) which failed significance 
at .05. The interaction between 
these two main effects also lacked 
significance at the .05 level (F = .70; 
df = 4/56;error 7S = .010). Group 
U had trials significant at less than .01 
(F = 3.76; df = 4/56; error MS = .025) 
and the inner peripheral 
variable — significant less than 
O1 (F = 38.58; 1/14; error 
MS = .053). interaction be- 
tween main effects was significant 
between the .05 and .10 levels 
(F=2.16; df =4/56; error MS=.032). 

The plots for Groups UR and RU 
also are shown in Fig. 3. 


also vs. 
at 
df = 

The 


Change in 
the patterning of cue lights influenced 
performance at both inner and peri- 
pheral sources, and in directions 
consistent with those for 


U and R. 


Groups 
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DISCUSSION 


Molar behaviorism concerns itself with 
S-R operations, and physiology occupies 
itself with either the effect of stimuli on 
internal bodily states or the effect of 
bodily states on overt behavior. Physio- 
logical psychology, on the other hand, 
is directed toward completing the arc of 
lawfulness from stimulus, to under-the- 
skin states, to molar behavior. A more 
substantial network of empirical laws 
can be achieved by coordinating the laws 
of psychology and physiology, and a 
firmer scientific footing is gained. In 
the two experiments reported here, we 
sought to move in this direction by trying 
to link characteristics of the task, prop- 
erties of the reticular formation, and 
overt vigilance behavior. Our results 
were negative. Neither the amount nor 
variety of stimulation induced by mani- 
pulating head and eye movements, nor 
response complexity, influenced vigilance 
decrement as we had predicted from the 
activationist hypothesis. An impressive 
number of facts for vigilance behavior 
casually fit the general framework of the 
activationist hypothesis, but the absence 
of operational definitions for the type of 
stimuli, as well as the characteristics of 
each stimulus class, gives the hypothesis 
little predictive capability for measures 
of molar behavior (Frankmann & Adams, 
1962). Our two experiments suggest that 
the specification of relevant stimuli and 
their properties is hardly straightforward, 
and they emphasize the problems of 
operationally defining variables of the 
hypothesis well enough for it to be 
thoroughly tested and graduated into a 
substantive psychophysiological law. 

There are plausible reasons 
why our tests of the activationist hy- 
pothesis failed, and they all illustrate the 
difficulties of operational definition that 
must be resolved before the hypothesis 
can be accepted or rejected. Perhaps our 
manipulations did not induce differential 
stimulation of the reticular formation. 
Adams, Stenson, and Humes (1961) 
found that number of stimulus sources 
did not exert a discernible effect on 
vigilance decrement, and they conjec- 


several 
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tured that increasing the number of 
stimulus sources might not have pro- 
duced an increment in stimulation at the 
reticular formation. The same reasoning 
might be applied here. Or, perhaps the 
operations we used did not sufficiently 
arouse the reticular formation. We did 
not have direct physiological measure- 
ment of the impact at the reticular 
formation of these stimuli generated by 
head and eye movements, but vigorous 
reticular activation from these operations 
is certainly an expectation from our cur- 
rent physiological knowledge. Also, the 
operations of immediate memory might 
be questioned as a way of inducing 
cortical-centered stimuli, but the task of 
keeping the most recent four numbers in 
mind has a reasonableness about it for a 
central source of activation. Even if our 
selection of stimulus classes is granted, 
we still may not have uncovered mani- 
pulations for the relevant dimension of 
stimuli. Both the memory and the 
visual scanning tasks in Exp. I may have 
repetitive 
decrement could occurred as a 
function of adaptation to the steady 
state of internal stimuli, as Scott (1957) 
has suggested. Early adaptation may 
have been a factor in Exp. II where we 
explicitly tried to introduce variety in 
key stimuli. Sustained stimulus 
variety, whatever its operational defini- 
tion might be, may prove to be the key 
to human alertness. 


become rather early, and 


have 


our 


Possible directions 
for inferring relevant dimensions of stim- 
ulus variety for visual alertness is the 
study by McGrath (1960) where mean- 
ingful audio stimulation such as musi 
improved detection, and the studies by 
\dams and Boulter (1960) and Adams, 
Stenson, and Humes (1961) where four- 
choice decision responding improved 
monitoring behavior. McFarland, Hol- 
Hurvich (1°47) found that a 
brief interlude of stretching and con 
versation with E eliminated decrement 
in a measure of visual threshold. 

The typical vigilance experiment, 
where marked decrements are found in 
prolonged monitoring behavior, differs 
from those presented here in that near 
threshold stimuli occur at a single stim- 


way, and 
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ulus source and percentage of signals 
detected is the measure. However, there 
is nothing that would seem to preclude 
our task with its latency measure as a 
sound vehicle for testing the hypothesis. 
Decrement was uniformly obtained but 
we were unable to control it by 
pulating sources of stimulation. 


mani- 


SUMMARY 


Che activationist hypothesis contends that 
environmental and internal sources of stimula- 
tion, working through the reticular formation 
of the brain, are sources of human alertness 
Iwo experiments were performed to identify 
stimulus determinants of vigilance decrement 
in a complex visual monitoring task. The S's 


task was to detect a two-digit number as 
rapidly as possible when it appeared at any 
1e stimulus 

latency was the measure of performance 


one of four sources Response 

Experiment I sought to manipulate 
response-produced stimulation arising from 
the stimulation induced by head 


movements and immediate memory. 


and eye 
Amount 
of head and eye movements was defined in 
terms of the spatial separation of sources, and 
the physiological basis of its stimulation value 
for alertness was hypothesized to be pro- 
prioception collaterals in the 
Immediate memory was 


and visual 
reticular formation. 
defined as a requirement to remember the 
numbers that last appeared at the 
sources and, when a number appeared, t 
respond with respect to the last number that 
had appeared at that Immediate 
memory was hypothesized to provide inputs 
to the reticular formation from cortical areas 
No effects of 
decrement were found 


four 


source 


these variables on vigilance 
Experiment Il asked 
if the negative results of Exp. I were related 
to amount rather than variety of stimulation 
being manipulated. Experiment II intro- 
duced variety in head and eye movements by 
training the pattern of visual 
responses and then changing the pattern on a 
trial 

found 


observing 


No effects on vigilance decrement were 
Problems of operationally defining 
the activationist hypothesis were discussed 
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THE RELATIVE EFFICIENCY OF SEVERAL TRAINING 
METHODS AS A FUNCTION OF TRANSFER 
TASK COMPLEXITY ! 


GEORGE E. 


BRIGGS anp JAMES C. NAYLOR 


Ohio State University 


In a recent review Naylor (1962) 
has suggested that the relative efh- 
ciency of part- and whole-task train- 
ing schedules is a function of two 
variables : complexity and organization. 
In his taxonomy, Naylor defines 
complexity as a function of the 
information-processing demands im- 
posed on S by each component of the 
task separately, while task organiza- 
tion is specified by the demands placed 
on S by the 
relations between 


interactions or inter- 
task components. 
Thus, task complexity is an intra- 
component characteristic, task or- 
ganization an intercomponent char- 
acteristic. 

Naylor (1962) further sug- 
gested that the efficiency of a whole- 


has 


task training method (as compared to 
a pure-part task method) will increase 
as task complexity is increased, i.e., 
there should be an interaction between 
training method and task complexity 
in terms of transfer performance. 
Therefore, those tasks which place 
greater intradimensional demands 
upon S should be learned best by 
whole practice, whereas those tasks 
having intradimensional task 
requirements should show a decrease 
in the relative efficiency of the whole 
training method. 

‘This research carried out in the 
Laboratory of Aviation Psychology and was 
supported by the United States Navy under 
Contract No. N61339-950, sponsored by the 
United States Naval Training Device Center, 
Port Washington, New York. Permission is 
granted for reproduction, translation, pub- 
lication, use, and disposal in whole or in 
part for any purpose of the United States 
Government. 


lesser 


was 


In tracking tasks, complexity is 
defined by the order of the machine 
dynamics through which S attempts 
to control. Thus, in a multidimen- 
sional tracking task, the dynamics of 
each dimension are what collectively 
determine total task complexity. 
More specifically, Birmingham and 
Taylor (1954) pointed out that an 
increase in dynamics, as from a simple 
positional (such as _ setting 
watch) to rate control (such as 
steering an auto), results in an in- 
crease in the information-processing 
demands placed on S, i.e., he must 
increase the use of derivative in- 
formation in the tracking error signal 
to maintain stability in rate task 
control. In fact, Russell (1951) 
clearly showed that compared to his 
transfer function (an analytic ex- 
pression of his information-processing 
functions) with a positional control, 
S will increase his damping (weighting 
of the first derivative of error in 
determining a control movement) by a 
factor of 25 when controlling through 
rate dynamics. Thus, a more simple 
transfer function is adequate for 
position control tasks, but rate dy- 
namics require rate estimates by S, 
thereby increasing the complexity of 
his task. 


one’s 


Similarly, a further increase in dy- 
namics from rate control to accelera- 


tion control results in an even more 
complex task dimension, since S now 
must provide a transfer function with 
the equivalent of two differentiations, 
one for each of the integral lags 


present in the control dimension. 
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The relationship between information- 
processing demands and control dy- 
namics is discussed in detail bv 
Birmingham and Taylor (1954). 

The present study represents an 
investigation of the task complexity 
hypothesis. In addition to whole and 
pure-part task training methods, the 
present study included groups trained 
via either a progressive-part or a 
simplified-whole method. 


METHOD 


A pparatus.—A three-dimensional compen- 
satory tracking system defined the skill task. 
The S observed three center-reading meters 
labeled “heading,”’ “‘altitude,”’ and “yaw,” 
and he was instructed to maintain null 
readings on these meters as much of the time 
as possible. A single input signal was applied 
to all three dimensions in the form of a simple 
sinusoid of .03 cps. 
versions of the task 


In the more complex 
see below) meters show- 
ing “altitude rate’’ and/or “heading rate” 
were provided. All meters were mounted on a 
single 10.5 X 19 in. panel which was located 
approximately 24 in. who mani- 
pulated a single, three-dimensional control 
The control device two-dimensional 
stick where the three control stick movements 
were left-right front-back (alti 
tude), and clockwise-counterclockwise 
tion of the head (yaw). All 
display linkages conformed to 
stereotypes. 

An EASE analog computer provided the 
task dynamics, as illustrated in Fig. 1. As 
indicated, there were three levels of control 
complexity this Tracking 
proficiency was measured during each trial in 
terms of integrated absolute tracking error 
for each of the three dimen- 


from 3, 
was a 


heading), 
rota- 
three control- 
population 


used in study 


(average error 
sions separately 

Design.—Two independent variables were 
manipulated in a transfer of training para 
digm: transfer task complexity (two levels 
and method of training (four methods The 
two levels of transfer task complexity were 
Level Il and Level II1 of Fig. 1 Che four 
training methods were whole-task training, 
pure-part training, progressive-part training, 
ind simplified-whole task training. There 
were two groups of Ss for ea h of the training 
methods (one for each of the transfer task 
complexity The two whole-task 
groups tracked through Level II and Level III 


dynamics, respectively, for a total of 10 daily 


levels). 
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various levels of task complexity 


System dynamics used to obtain 


sessions. ‘The remaining six groups received 
8 sessions on a training task and 2 sessions on 
the final transfer task. For-three of the six 
groups the final transfer task involved Level 
I! dynamics, while the other groups 
experienced Level [Il dynamics upon final 
transfer. 


three 


Pure-part training involved four 35-se 


trials per session on each of three separate 
dimensions i 


Progressive-part training in- 
volved an initial three sessions on a pure-part 
schedule followed by five sessions during each 
of which S practiced four 35-sec. trials on 


each of the three possible pairings of the 


rABLE 1 


EXPERIMENTAL DESIGN OF TRAINING 
AND TRANSFER CONDITIONS 


W hole 
Pure-Part 
| Progressive 
Simplified 


-Part 





Whole 
Pure-Part 
Progressive-Part 
Simplified 


See Fig. 1 
and IIT 
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heading, altitude, and yaw dimensions. Of 
the two groups which experienced the sim- 
plified-whole training method, one trained for 
eight sessions on Level I dynamics (see Fig. 1) 
and then transferred to Level II, while the 
other group trained on Level II and trans- 
ferred to Level III. Table 1 summarizes the 
training and transfer conditions. 

Subjects and procedure.—A 
male 
study. 


total of 144 
participated in this 
Assignment to the eight groups was 
based on randomization procedures with the 
restriction that groups be filled approximately 
equally throughout the data collection period. 
\t the end of this period there were 18 Ss 
per group. The Ss were reimbursed $10.00 
for their time. 


undergraduates 


There were 12 35-sec 
each of the 10 sessions. 
four-trial 


tracking trials during 

Practice occurred in 
blocks with 25 sec. rest between 
trials within a block and 1 min. rest between 
blocks. Scoring occurred over the final 30 
sec. of each trial to avoid the initial transients 
in tracking performance; thus, a block of four 
trials served as the unit of data analysis and 
represents a behavioral sample of 2 min. for 
each S. During the 12 trials of the initial 
session, all Ss controlled the task without the 
03-cps input signal; that input was present 
during all remaining trials. The purpose of 
this was to facilitate the initial 
acquisition of skill. All three levels of task 
dynamics provide inherently unstable sys- 
tems, and thus the system tends to go out of 
control quite rapidly for the skill novice; 
removing the input markedly improved S’s 
chances of “keeping control” over the system 
during the initial training trials as it provided 
a less difficult task. 

All Ss were instructed to keep the meter 
displays representing heading, altitude, and 
yaw at their null position at all times, if 
possible. It was emphasized that control 
over all three task dimensions was required, 
and S was reminded of this if his performance 
indicated that he paying particular 
ittention to one or two displays to the 
The S 
was informed also of the usefulness of the 
rate display (if it were present, see Fig. 1). 

The performance metric, average error, 
is actually the average deviation of the error 
amplitude distribution generated by S during 
each tracking trial. Therefore, the metric 
describes S's intratrial variability in tracking 
accuracy. The score was recorded originally 
in voltage units; however, following data 
collection all scores were transformed to units 
of inches of arc, thereby providing a scale 
matching that of S’s tracking display. 


proc edure 


was 


detriment of the other dimension(s). 


RESULTS 


rhe results from both the training 
and the transfer sessions are sum- 
marized in Fig. 2 (Groups 1-4) and 
Fig. 3 (Groups 5-8). The tracking 
error scores were combined for all 
three dimensions and averaged for 
each group. In order to provide 
comparable data points, it was neces- 
sary to derive daily averages for the 
training sessions, i.e., the pure-part 
and the progressive-part groups ex- 
perienced all three tracking dimen- 
sions during each of the eight training 
sessions; however, during transfer it 
was possible to derive comparable 
data points for four-trial blocks. The 
use of combined error scores is justified 
here, since the performance functions 
on the three dimensions are quite 
similar in relative rank order and no 
apparent artifacts are introduced by 
dealing with ‘‘total task’”’ performance 
in either the training or the transfer 
sessions. 

Training.—Several points are of 
general interest from the training data 
of Fig. 2 and 3. First, as expected, 
tracking performance is best during 
Sessions 1 through 8 for that whole- 
task group, Group 8, which ex- 


perienced the least complex system 
(Level 1), and the least proficient 
performance was attained by Group 1 
which experienced the most complex 


Fic. 2. Training and transfer perform- 
ance levels for Groups 1-4. (Transfer to 
Task Complexity Level III.) 
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ance levels for Groups 5-8 Transfer to 


lr'ask Complexity Level I] 


whole task (Level Ill). Both Groups 
4 and 5 trained on the same whole 
task (Level I1), and their performance 
levels (a) are not significantly differ- 
ent one from another (P > .05) and 
(b) fall intermediate to 
Groups 1 and 8. 

It may be recalled that no input 
signal was employed during Session 1 
for any group; thus, these data are 
set apart in Fig. 2 and 3. Also, it 
may be recalled that Groups 3 and 7 
transferred from a pure-part schedule 
to practice on pairs of task dimensions 
on Session 4; therefore, the figures 
show a discontinuity between Sessions 
3 and 4 for Groups 3 and 7. It is 
interesting to note on Session 4 and 
thereafter that Group 3 surpasses the 
whole-task group, Group 1, whereas 
Group 7 never does exceed the per- 
formance of Group 5, even though 
Group 7 experienced only two track- 
ing dimensions while Group 5 was 
confronted with all three dimensions. 
This may well be a result of the very 
simple yaw dynamics present in the 
Level II task (see Fig. 1). 

In addition to the comparable per- 
formance levels of Groups 5 and 7 
late in training, it may be noted from 
Fig. 3 that Groups 6 and 8 achieved 
similar “total task’ performance. 
Again, this is probably a result of the 
very simple task dynamics encount- 
ered by Group 8 during training. 
The more complex dynamics experi- 


those of 
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enced by Groups 1-4 would account, 
therefore, for the clear separation of 
performance levels for these groups 
during the latter training sessions (see 
Fig. 2). 

An analysis of variance was applied 
to the data of Groups 1-3 and Groups 
on Session 8. Groups 4 and 8 
were not included, since their task 
dynamics differed from those of the 
companion groups. The results of the 
analysis indicated that training meth- 
ods (F = 78.6; df = 2/102), task 
complexity (F = 85.2; df = 1/102), 
and the Methods K Complexity inter- 
action (F 28.0; df = 2/102) were 
statistically significant at P < .05. 
The interaction, illustrated in Fig. 2 


haat | 


and 3, shows that the greatest per- 
formance difference between the Level 
I! and the Level III tasks occurred 
with the whole-task groups, while the 
progressive-part groups showed less 
influence of the task complexity vari- 
able, and no difference (P > .05) was 
found between Groups 3 and 7. The 
only other group difference which did 
not attain statistical significance at 
P < .05 on 8 was that be- 
tween the whole and progressive-part 
groups on the Level II task dynamics 
(Groups 5 and 7 as mentioned earlier). 

Finally, it may be noted from Fig. 2 
and 3 that all groups had approached 
asymptotic performance levels at or 
before transfer. 


Session 


This was especially 
the case for the two part-task practice 
groups (Groups 2 and 6). If any- 
thing, Groups 2 and 6 were over- 
trained in eight sessions. 
Transfer.—\t would appear from 
ig. 3 that during the initial block of 
transfer trials Groups 6 ‘and 8 per- 
formed at a level inferior to Group 5 
on Session 2. It should be recalled 
that the data point for Group 5 on 
Session 2 is the average for three 
blocks of trials, and when performance 
for Group 5 on the first block of 


Session 2 is compared to that of 
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Groups 6 and 8 on the first block of 
Session 9, the 
as unexpected. The transfer 
(C; — Ei/C; — C:;) X 100 was 
for this comparison, where C 


result is not quite 
index 
used 
is the 
performance of Group 5 on the first 
block of Session 2, C; is the perform- 
ance of the same group on the first 
block of Session 9, and FE; is the 
performance of Group 6, 7, or 8 on the 
latter block of trials. This index of 
transfer is particularly appropriate 
when the raw data are error scores, as 
in the present case, 
describes relative improvement in 
performance as a function of training. 


and the index 


The same index was applied to the 
data of Groups 1-4 and the results for 
all groups were: part training 22% 
transfer to Complexity Level II, 31° 
to Complexity Level III; progressive- 
part training 75% transfer to Level 
Il, 94% to Level III]; and simplified 
training 28% transfer to Level II, 
44% to Level III]. One may note that 
(a) Group 3 attained a very high level 
of performance during transfer (94%), 
while Groups 2, 4, 6, and 8 attained 
rather 
formance 


poor relative transfer per- 
and (6) transfer 
performance was relatively higher on 
the more complex task than on the 
less complex task. 
that 


expressed in 


levels, 


It is important to 
transfer data are 


relative 


note these 
terms, as in- 
If examined in terms 
of absolute transfer performance levels, 


the 


dicated above. 


pattern of results as listed is 
partially reversed. 

This latter point may be illustrated 
by reference to an analysis of variance 
which was performed on the data of 
all eight groups over the entire six 
blocks of transfer trials. The results 
lable 2 where it is 
apparent that, as was the case with 


are summarized in 


the training data, there are significant 
differences among the training meth- 
ods and between the two levels of task 
complexity, Further, and of direct 
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relevance to the above statement on 
absolute transfer performance levels, 
the interaction of training methods by 
task complexity is statistically sig- 
nificant at P < .05. Figure 4 con- 
tains the group averages which define 
this interaction. There it is obvious 
that considerably greater absolute 
differences occurred between the whole- 
task group (Group 1) on the more 
complex transfer task and the groups 
trained by either pure-part or sim- 
plified-whole methods (Groups 2 and 
4) than the comparable 
groups on the less complex transfer 
task (Group 5 vs. Groups 6 and 8). 
However, again it should be 
phasized that relative comparisons 
between control (whole training) 
groups and the experimental groups 
show better transfer performance on 
the more complex task than on the 
less complex task for the latter groups. 
The other statistically significant 
interactions indicated in Table 2 are 
of interest. The Transfer Blocks 
< Training Methods interaction indi- 
cates a difference in the slopes of the 
transfer performance curves for the 
several groups. By reference to Fig. 2 
and 3 it may be seen that the slopes 
for the whole task and progressive- 
part task 
whereas 


between 


eim- 


slight, 
and 


groups 


those for 


are 
the 


very 
part 


rABLE 2 


\NALYSIS OF VARIANCE AS APPLIED TO THI 
TRANSFER TRIAL Blocks OF SESSIONS 9 
AND 10 FoR ALL Groups 


Source 


Training methods (M 
Task complexity (C 
MXC 

Ss within methods 


ransfer blocks (B 
x M 

x Cc 

xCxM 
3 xX Ss/M 
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Average Error (inches) 











Transfer Task Complexity 


Fic. 4. Transfer task performance averaged 
over Sessions 9 and 10 for all groups. 


simplified-whole groups rather 
steep. Thus, while initial transfer 
performance for the latter two groups 
was relatively their 
following initial transfer was quite 
rapid. 

The Transfer Blocks X Task Com- 
plexity interaction is more difficult to 
see in Fig. 2 and 3. However, it is a 


are 


poor, progress 


result primarily of the especially high 
rates of improvement by Groups 2 and 


4 on the more complex task. It fol- 
lows, therefore, that not only was 
relative transfer higher for these 
groups on the more complex task, but 
also their rate of approach to the 


performance level of the control 


TABLE 3 
Grour COMPARISONS OF TRANSFER 
PERFORMANCE ON BLOCK 1 
(UprPeR PART) AND ON 
BLock 6 (LOWER 
PART) FOR ALL 
Groups 


Level III Level II 


P P-pP > P-P 


ms |. .O5 | ns 
5 .05 


ns 


Note.—Cell entries indicate statistical significance at 
P < .05 and at P > .05 (ns). W, P, P-P, and S iden 
tify whole, part, progressive-part, and plified-whole 
methods 


group (Group 1) was greater than that 
for the comparable groups on the less 
complex system. 

Following the analysis of variance, 
summarized in Table 2, a Duncan 
multiple range test (Duncan, 1955) 
was applied to the data for all groups 
on the first block of transfer trials and 
on the last block. A summary of these 
comparisons is provided in Table 3. 
The upper half of each section of 
Table 3 represents comparisons made 
with the initial block of transfer trials, 
while the lower half is from the last 
block of transfer trials. These results 
show that whereas there were differ- 
ences among groups early in transfer, 
the relatively short transfer experience 
with the whole task eliminated 
all group differences; therefore, the 
effects of training method were quite 
transitory. 


DISCUSSION 


Several findings of the study are of 
particular interest. First, the relative 
transfer indices are higher for all three 
experimental groups on the more com- 
plex task. Using this measure of method 
efficiency, one finds an indication that the 
part-training methods are actually more 
efficient when the task is highly complex 

a denial of the Naylor hypothesis. 
Second, using absolute transfer perform- 
ance (see Fig. 4), one finds that only the 
progressive-part schedule showed an im- 
provement relative to the whole method. 
These data lead to the conclusion that 
both the pure-part and the simplified- 
whole methods become less efficient 
training techniques for highly complex 
tasks and that the whole and some 
schedule of progressive-part methods 
emerge as more satisfactory training 
techniques as task complexity is in- 
creased. This substantiates the hy- 
pothesis that task complexity and train- 
ing method interact, with whole training 
becoming relatively more efficient as 
complexity is increased. Since the 
validity of the hypothesis therefore de- 
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pends upon the measure of transfer that 
is used, it is not possible to accept or 
reject without ambiguity. 

One possible source for this ambiguity 
lies in the second task variable mentioned 
by Naylor (1962). It is likely that an 
increase in component complexity results 
in a corresponding increment in the 
organizational demands of the _ task. 
With the three-dimensional task used 
here, the primary organizational require- 
ments imposed on S consisted of time- 
sharing demands, i.e., he had to develop 
an efficient pattern of attention alterna 
tion among the three displays and avoid 
undue emphasis on a 
dimension 


single 
Thus, as task 
increased, S must co- 
ordinate his responses to the several task 


system 
(component). 
complexity is 


dimensions more exactly to avoid large 
errors in one or another of the dimensions. 

\ comparison illustrating the impor- 
tance of time sharing is that between the 
simplified-whole, the progressive-part, 
and the pure-part training methods. If 
increases in complexity of the system 
components influence task organization, 
then the pure-part method should show 
less effect of task complexity, while the 
progressive-part and the simplified-whole 
methods should be affected to a greater 
extent. This prediction was substanti- 
ated by the data: there is no statistically 
significant difference between Groups 2 
and 6 (pure-part) during the training 
sessions; however, both Groups 3 and 7 
(progressive-part) and Groups 4 and 8 
(simplified-whole) differ markedly (see 
Fig. 2 and 3). 


Perhaps one of the more intriguing 


findings related to time-sharing concerns 
the relationship between the whole and 
the progressive-part groups during trans- 
fer. It would appear that time-sharing 
can be learned quite well by the progres- 
sive-part method, as there were no 
significant differences between the whole 
and progressive-part groups at either 
level on Transfer Block 1 (Table 3). 
Since, at the most, only practice on pairs 
of task dimensions was given in the 
progressive-part method, the acquisition 
of time-sharing skill may not 
practice on the total task, 


require 
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it would appear, then, that the time- 
sharing demands of multidimensional 
control tasks represent a potent variable 
determining the relative efficiency of 
training methods. However, this vari- 
able accounts for only part of the transfer 
results. If time-sharing requirements 
were the only variable of importance, 
then those groups trained via a simpli- 
fied-whole method should have been 
superior to the pure-part groups during 
transfer. It may be noted from Table 3 
that these groups did not differ signifi- 
cantly at P < .05; thus, the apparent 
superiority of Groups 4 and 8 over 
Groups 2 and 6, respectively, in Fig. 2 
and 3 is only apparent—not real. This 
was a completely unexpected finding of 
the present research. It was expected 
on the basis of an earlier analysis by 
sriggs and Waters (1958) that a sim- 
plified-whole training method would 
provide superior transfer performance 
compared to that following pure-part 
training. 

Also, Briggs, Fitts, and Bahrick (1958) 
had previously demonstrated that greater 
amounts of training resulted in greater 
amounts of transfer for a_ simplified 
training procedure. Since the amount of 
training provided in this study was 
greater than that used in the Briggs et al. 
study, large amounts of transfer for the 
simplified groups were anticipated. 

Therefore, it may be that the benefit 
accrued by the simplified-whole groups 
during training (the acquisition of time- 
sharing skills) was offset in comparison 
with the pure-part groups by their train- 
ing on the specific dynamics of the 
transfer task. Similarity of training and 
transfer tasks emerges, therefore, as a 
second major determinant of transfer 
performance. 

It is concluded that form of 
progressive-part training will be superior 
to methods pure-part and 
simplified-whole for the acquisition of 
skill in a complex, multidimensional task, 
since the progressive-part method utilizes 
a training task of high similarity to the 
transfer task and it also provides an 
opportunity to develop efficient 
sharing behavior, 


some 


such as 


time- 
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Finally, attention is directed to the 
fact that practice on the whole task 
during transfer rather quickly eliminated, 
at least statistically, the differential 
effects of training methods (see Table 3). 
Since whole-task training resulted 
in numerically superior performance 
throughout training, the only argument 
for the pure-part or simplified-whole 
methods rests upon the potential savings 
in training time on the whole task itself 
following training. This argument is 
tenable only to the extent that (a) a 
minimum of whole-task training can 
equate the effects of the methods in 
terms of final performance, and (b) dur- 
ing transfer following pure-part or sim- 


plified-whole training the initial pro- 


ficiency of S is able to meet some criterion 
of operating safety, i.e., it is conceivable 
that even after extended training S could 
“lose control’ of a complex system and 


thereby violate safety margins for him- 
self and the system. 

The first point above was met in the 
present study by the very rapid progress 
of the pure-part and simplified-whole 
groups during transfer; however, their 
original transfer performance was very 
and the pure-part groups, in 
particular, probably would have been 
judged ‘‘dangerous”’ 
and transfer tasks 
vehicular system. 


poor, 


had their training 
been in a real-life 


SUMMARY 

The relative efficiency of four training 
methods (pure-part, progressive-part, simpli- 
fied-whole, and whole task) in the acquisition 
of skill was investigated in a three-dimensional 
tracking task. Two levels of transfer task 
complexity were used. There were eight daily 
training sessions followed by two transfer 
sessions on the whole task. 

The results showed that training via the 
whole and the progressive-part methods re- 
sulted in statistically equivalent performance 
during transfer for both transfer 
task complexity. Both the whole and the 
progressive-part methods resulted in transfer 


levels of 


performance which was statistically superior 
to that of Ss trained via either the pure-part 
or the simplified-whole method. Further, as 
task complexity was increased, the whole and 
the progressive-part training methods in- 
creased the absolute (but not the relative) 
superiority of transfer performance compared 
to that with pure-part and simplified-whole 
methods. 

Two factors emerged from a logical analysis 
of the results to explain the superiority of the 
whole and the progressive-part methods: 
first, the transfer (whole) tasks required 
rather efficient time-sharing skills and these 
could not be acquired under the pure-part 
method since, by definition, S experiences 
only one dimension of the task at a time under 
this training method, and second, training and 
transfer task similarity was not as high for 
the simplified-whole groups as it was for 
those Ss trained under the progressive-part 
method; thus, even though the former Ss 
could acquire the time-sharing 
skills, they were “‘penalized”’ by training on a 
task which bore less similarity to the transfer 
task than was the case for the latter Ss 


necessary 
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RECRUITMENT, LATENCY, MAGNITUDE, AND 
AMPLITUDE OF THE GSR AS A FUNCTION 


OF INTERSTIMI 


WILLIAM F. PROKASY, JAMES 1 


Pennsylvania 


White and Schlosberg (1952) and 
Moeller (1954) report that, within the 
ranges of 0 and 5 sec., the optimum 
(ISI) in GSR 


conditioning is approximately .5 sec. 


interstimulus interval 


when magnitude of response is em- 


ployed as the dependent variable. 


Bierbaum (1955), in contrast, reports 


a greater 
3 sec. 
While it is not clear why his ISI 
function should diverge from that ob- 
tained by White and Schlosberg 
(1952) and Moeller (1954), Bierbaum 
(1955) reports, in addition, that re- 
cruitment (time elapsing between the 
ot 


response magnitude near 


onset a GSR and its maximum) 
increases with the ISI. 
Unpublished data from our laboratory 
further, that both GSR 
latency and recruitment modify across 
extinction trials. 

It is the of the present 
study to examine further GSR 
conditioning as a function of ISI and 
b) to relate the independent variable 
(ISI) to three GSR attributes other 
than magnitude: latency, recruitment, 
and amplitude. 


increases in 


suggest, 


purpose 


(a) 


The distinction be- 
tween magnitude and amplitude first 
was made by Humphreys (1943) and 
Hil- 


more recently was discussed by 


[his study was supported by an NSI 
G-7463) to the senior author 

lhe authors gratefully acknowledge the aid 
of Herbert Krauss and Barry Lively for their 
aid in the data to log con- 
and of the staff at the com- 
puter center of Pennsylvania State University 
for making the IBM 650 available to us. 
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transforming 
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State University 


gard (1951, p. These authors 
employed the word magnitude to refer 
to a mean based upon all trials, includ- 
ing those which resulted in no measur- 
able response, while they adopted the 
word amplitude to refer to means 
derived only those trials 
which a response occurred. 


528). 


from on 


METHOD 


The Ss 129 men and 
women enrolled in introductory psychology 
who volunteered with the knowledge that 
would be employed. Of these, 23 were 
lost due to equipment malfunction, adapta- 
tion to shock, or E's error. 

A pparatus 


Subjects. were 


shox k 


The GSR was measured by 
means of the Fels dermohmeter. The record 
electrodes set 
it the base of Plexiglas cups of .125-in. depth 
\ zinc oxide electrode paste filled the cups and 
made contact with S, 
approximately 1 in. 


ing electrodes were .75-in. zinc 


The cups were placed 
apart (center to center 
on S's left palm with a constant current of 
70a. transmitted between them. Resistance 
changes were recorded on an Esterline-Angus 
milliammeter operating at 
4 in. per min. 

The CS, a 76-db. (re .0002 
1000-cps tone, was presented by means of a 
Grason-Stadler twin oscillator through Permo 
flux PDR-8 earphones, and was superimposed 
background of white 
generated through the earphones at 52 db. 
lhe UCS was a shock administered by an a 
variac through .5-in. copper electrodes taped 
to S's right index finger. CS and UCS 
controlled through Hunter 


a paper speed of 


‘ 
aynes/cm 


on a constant noise 


duration were 
Brown interval timers. 

Prov edu e 
domly, to one of five treatment conditions 
0-, .5-, 1-, 3-, or 5-sec. ISI. CS durations for 
the five conditions, respectively, were: .2, .5, 
m5 UCS duration was .2 sec. and 
began with the termination of the CS in all 
ind UCS 


The Ss were assigned, ra 


and 5 sec 


except Group 0. in which the CS 


OV erlappec 1. 
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The S was seated in a sound-shielded room 
(6 X 10 X 6.5 ft.) which was illuminated by a 
75-w. incandescent bulb. After affixing GSR 
and shock electrodes, E indicated that shock 
would be administered in increasing amounts, 
beginning with a barely perceptible levei, 
until it reached a level which S judged to be 
“highly annoying, but not painful.”” The 
mean voltage achieved was 58.7 (SD = 14.5). 
rhe S was then instructed that he would 
receive either the already experienced shock 
to the finger, a tone through the earphones, 
or both, but that there was no necessary 
relationship between the two events. The 
earphones were positioned and E returned to 
the outer chamber to initiate training trials. 
All Ss received 20 CS-UCS pairings followed 
by 10 test (extinction) trials. The intertrial 
interval was varied unsystematically between 
25 and 35 sec. Final Ns in each group were: 
Group 0, 20; Group .5, 22; Group 1, 22; 
Group 3, 20; and Group 5, 22. 

Response measures.—The 
ployed on each test trial was the log of 
conductance change, as measured from the 
base at CS onset to the point of maximum 
pen deflection following any response initia- 
tion that occurred between 1 and 5.5 sec. 
after CS onset. To log conductance change 
was added a constant of 9, thus making all 
scores positive with a range of from .549 
to 3. (Only 3 Ss yielded scores below 1, 
and then on only several trials each.) In the 
event that the pen deflection failed to meet the 
criterion of a response, a score of zero was 
recorded. The minimum deflection recorded 
(as measured by a specially constructed 
template) was 25, 50, 100, 250, or 500 ohms, 
depending upon the dermohmeter sensitivity 
setting required to adjust appropriately to the 
extent of S’s UCR during shock adjustment. 

Both magnitude and amplitude 
recorded as the mean of log conductance 
change plus 9 on Test Trial 1 and on each of 
three blocks of three of the subsequent nine 
test trials. Magnitude was obtained by in- 
cluding the score of zero on trials in which no 
response occurred. The amplitude index did 
not include scores of zero. For example, if 
S’s scores on Test Trials 2, 3, and 4 were 
2.5, 0, and 1.1, mean magnitude for those 
trials would be 1.2 while mean amplitude 
would be 1.8. 

Latency was defined as the time elapsing 
between CS onset and the occurrence of the 
first response in the criterion range, while 
recruitment was defined as the 
time elapsing between response initiation and 
response peak. The units of our template 
were in .03-in. increments which, in conjunc- 


measure elm- 


were 


amount oO, 


as a 
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Fic. 1. \mplitude 
function of ISI, 
lrial 1. 


and magnitude 
measured on Test 


tion with the paper speed of 24 in. per min., 
permitted time measurement to the nearest 
75 msec 


RESULTS 


First extinction trial.—Responses on 
the first extinction trial were em- 
ployed as the most direct index of the 
influence of the ISI variable, as sub- 
sequent responses necessarily reflected 
the added operation of UCS omission. 
Mean magnitude and amplitude as a 
function of ISI are shown in Fig. 1. 
Amplitude means, based only upon 
those Ss who responded, are obtained 
from Ns of 17, 22, 22, 16, and 17 for 


meg hey 


Groups 0, .5, 1, 3, and 5, respectively. 

Magnitude is greatest at .5 sec. 
and declines with longer ISI intervals, 
the decrease over ISI values of .5 to 5 
sec. being significant at the .05 level 


rABLE 1 


LATENCY AND RECRUITMENT MEANS AND SDs 
ON Test TRIAL 1 AS A FUNCTION oF ISI 


Recruitment 


Latency (Sec.) 


MIANMMNAI 


— em RN 





GSR AS A FUNCTION OF INTERSTIMULUS INTERVAL 


TABLE 2 


MEAN LATENCY, RECRUITMENT, AMPLITUDE, AND MAGNITUDE OVER 


THREE BLOCKS 


oF THREE TEST TRIALS AS A FUNCTION oF ISI 


Amplitude 


Magnitucle 


Trial Block Trial Block 


od 


“SUIS 
Nm NM he le be 
NN le te he 


Note 


Within 

latency, and fro 

F = 3.98, df =3/82, error MS =.593). 

A similar analysis of amplitude did 

not result in significant differences 

(F = 1.57, df =3/73, error MS=.119). 
Table 1 summarizes latency and 

recruitment data on Test Trial 1. 

For neither recruitment (F = 1.1, 


df = 4/90, error MS = 1.273) nor 


latency (F = 1.37, df = 4/90, error 


MS .775) were the differences 
among groups significant, although it 
can be noted that the longest latencies 
occurred in Groups 3 and 5. 

Final nine extinction trials.—Data 
of all Ss who responded at least once 
in each of three blocks of three ex- 
tinction trials were employed in the 


analyses of the 


variance of four 


w 


i) 


N WN Dd to 
OO me fe oe 
os OO Ww 
Nm Ww dN bv he 
wuianv 
- OO #= OO 
w Ww bo bv be 
ante Ui 
vo) 


—muo 


2.096 


a 


response measures. Table 2 sum- 
marizes the means for all groups while 
Table 3 provides a summary of the 
analyses of variance. Significant be- 
tween-groups effects were obtained 
only with the recruitment and latency 
measures, in both instances a larger 
time value associated with 
longer ISIs. The significant between- 
Ss effect obtained with all measures 
attests to their reliability. 

Both magnitude and latency 
changed across test trial blocks, with 
magnitude decreasing and latency in- 
creasing. There was a tendency for 
amplitude to decrease, suggesting that 


being 


there was some decrease in the size of a 
response provided that one occurred. 


rABLE 3 


ANALYSES OF VARIANCE Of! 


AS A FUNCTION OF ISI OVER THREE 


mplitude 


3etween Ss 
ISI 
Error (b 
Within Ss 
Trials 
ISI X Trials 
Error (w) 


*P < O5 
—P < Oi. 


MAGNITUDE, AMPLITUDE, LATENCY, 
BLOCKS OF 


Magnitude 


AND RECRUITMEN1 


THREE Test TRIALS EACH 


Latency 


Recruitment 


76 6.061 
4.82*** | .668 


9.06*** 


?.58*** 


11.00*** | .935 
36 .264 
.259 


3.61" 
1.02 
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TABLE 4 
MAGNITUDE MEANS AND SDs For ALL Ss as A FuNCTION OF ISI OVER 
THREE BLOcCKs OF THREE TEST TRIALS EACH 


In the above analyses Group 0 had 
only 9 Ss that met the criterion of at 
least one response in each block of 
three test trials. Because of possible 
bias that might enter by such selection 
of Ss, a second analysis of magnitude, 
incorporating data from all Ss, was 
made. Table 4 provides the means 
and SDs while Table 5 summarizes 
the analysis of variance. 

Table 5 reveals, first, that there is a 
significant between-groups effect and, 
second, that response reliability (as 


evidenced by the sharply increased F 


value for between-Ss means) has in- 
creased. An examination of mean 
magnitude in Table 4 indicates that 
the bulk of the differences between 
groups rests in Group 0; that is, the 
increase in N in Group 0 over the 
prior analysis added sufficient num 


TABLE 5 
\NALYSIS OF VARIANCE OF MAGNITUDE FOR 
ALL Ss AS A FuNcTION oF ISI AND THRE! 
BLocks OF THREE Test TRIALS EACH 


Source 


Between Ss 
IS] 

Error (b 
Within Ss 
lrials 

ISI X Trials 
Error (vy 


lest Trial Block 


Mean 


Mean 


1.030 
1.960 
1.931 
1.892 


1.925 


1.116 
1.845 
1.915 
1.637 
1.715 


bers of zeros to provide a between- 
groups effect. 

Double responses.—Although 
ceding analyses were based on the 
first GSR occurring in the interval 
from 1 to 5.5 sec. after CS onset, there 
were, in Groups 3 and 5, frequent 
instances of a second response with a 
latency range of from 4 to 8.5 sec. 
after CS onset. Figure 2, a tracing 
of one such response from an S in 
Group 5, illustrates the onset of the 
first response, a plateau, and then the 
onset of the second response. In 
Group 5, of 20 Ss who responded, 19 
gave at least one double response. 
In Group 3, of 18 Ss who responded, 
15 gave at least one double response. 
With the exception of 1 S on one trial 
in Group 0, no double responses were 
observed in Groups 0, .5, and 1. 


pre- 





a | L 


ae 5 SECOND <=. 





- 


Fic. 2 


from 1 S in Group 5 


\ tracing of a double response 
(Onset of the 5-sec 
event marker is temporally coincident with 
the vertical line indicating initiation of R, and 
Rs, but is displaced due to physical locatio: 
of event recorder. 
respectively, 


R, and Re are latencies of, 


the first and second response 
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DISCUSSION 


Che ISI function obtained with mag 
nitude as the dependent variable is 
similar to that obtained by White and 
Schlosberg (1952) and Moeller (1954). 
Within the range tested, 
three studies suggest that magnitude is 
ISI of .5 sec., 

value obtained in 
studies of skeletal response 
conditioning Kimble, 1961, pp 
156f.). That amplitude did not vary asa 
function of ISI suggests that the mag 
nitude function is determined in larg: 
part by whether or not Ss in the different 
ISI conditions responded.’ On this basis, 


results of all 
greatest near an similar 
to the optimum 
numerous 


(see 


the distinction between magnitude and 
amplitude made by Humphreys (1943 
and Hilgard (1951) 
sideration. 


merits further con 


observations o1 
and recruitment 
precedent in the literature: the positive 
relationship between ISI and recruitment 
orroborates Bierbaum’s (1955) finding 
and the 


Two of the three 


latency have some 


increase in latency across ex- 
tinction trials is consistent with Pavlov’'s 
(1927, p. 49) report that the latency of 
salivation during extinction 
We have been unable to find a 
precedent for the positive relationship 
between latency and ISI in autonomi 
conditioning, though such a relationship 
has been with 


skeletal 


increases 
trials. 


conditioned 
1958 


obtained 


Boneau, 


responses ©.8., 


Ratios of number of responders to number 
Trial 1 were 17/20, 
22/22, 16/20, and 17/22 for, respectively, 
While there is no 
statistical test 


in each group on Test 


Groups 0, .5. 1. 3, and § 


entirely adequate available 


to compare these frequencies for the five 


groups simultaneously, a Fisher exact test 
comparing frequency of re sponders in Group 
5 with that of ind of Group 5 
yielded P values of, respectively, .043 and .032 
those 
Furthermore, employing the ob- 


Group 3 


for trequency 
obtained. 


disparities as large as 


tained order of mean magnitude 
to low) in the five 


from high 
basis for 
comparison, of the 120 possible ways that the 
ratios could be ordered only 5 


groups as a 


would cor 
respond as well as or better to the magnitude 
order than that obtained, even if the tied 
ratio between Groups .5 and 1 were counted 


as a reversal 


INTERSTIMULUS INTERVAI 


bel, 1961). Though data are limited, 
present evidence suggests that, in addi 
tion to magnitude, the very form of the 
GSR can be brought under the control of 
external stimulating events. Such a 
possibility warrants the continued ob- 
servation of latency and recruitment in 
GSR research. 

Few made to 
second responses in GSR studies. Rod- 
nick (1937), with ISIs of 17 and 21 sec., 
found that a would 
occur shortly before shock onset, and 
Stewart, Stern, Winokur, and Fredman 

1961) demonstrated the acquisition of a 
second GSR while employing an ISI of 
7.5 sec. Grings, Lockhart, and Dameron 

1961) have further, that this 
second differentiated moré 
rapidly than is the first response in a 
discrimination learning situation in which 
an ISI of 5 sec. is employed. While the 
nature of the response is not 
understood, it tends to occur prior to 
UCS onset. Whether or not its absence 
from Groups 0, .5, and 1 results because 
it does not 


references have been 


second response 


shown, 
response is 


second 


occur in these groups or 
because recording speed is too slow to 
separate response rate changes super- 
imposed on the first response remains to 
he investigated. 


SUMMARY 
| ‘Cc 


ntervals of 0, .5, 1, 3, or 5 se 


groups of Ss trained at CS-UCS 
. were employed 
study of the role of the interstimulus 
interval (ISI) in the conditioning of the GSR 
\ll Ss received 20 tone-shock pairings followed 
by 10 tone alone (test) trials 


of the GSR 


i! i 


Four ittributes 
measured: amplitude 
magnitude, latency, and recruitment 


were 


rhe principal findings were: (a 
vas greatest with an ISI of .5; (} 
but not amplitude, varied with 
and recruitment 
function of ISI; (d 
extinction trials; 


magnitude 
magnitude, 
ISI: f la 
tency both increased as a 
latency increased across 
and (¢ second responsé 


was observed freque thy ££. 


ISIs ots ind 5 sex 


expose d t 


REFERENCES 


BierBauM, W. B. Temporal aspects in con 
ditioning of the GSR. Unpublished doc- 
toral University of Florida, 
1955. 


dissertation 





318 W. PROKASY, J. T. 


lhe interstimulus interval 


BonEAu, C. A. 
and the latancy of the conditioned eyelid 


J. exp. Psychol., 1958, 56, 464 


_response. 
471. 
EpeL, H. C.  Stable-state behavior and 
reversibility as a function of the inter 
stimulus interval variable in classical eyelid 
conditioning. Unpublished master’s thesis, 
Pennsylvania State University, 1961 
Grincs, W. W., Lockwart, R. A., & Dam- 
ERON, L. E.  Interstimulus interval as a 
variable in GSR conditioning of mentally 
deficient individuals. Paper read at 
American Psychological Association, New 
York, 1961 
HILGARD, E. R. 
the study of learning. In S. S. Stevens 
(Ed.), Handbook of experimental psychology. 
New York: Wiley, 1951. 
HuMPHREYs, L. G. 
conditioned eyelid responses. J. gen 
Psychol., 1943, 29, 101-111. 


Methods and procedures in 


Measures of strength of 


FAWCETT, 


AND J. F. HALL 


KimB_e, G. A. Hilgard and 
conditioning and learning. (2nd. ed.) 
York: Appleton-Century-Crofts, 1961. 

MoeE.LLeR, G. The CS-UCS interval in GSR 
conditioning. J. exp. Psychol., 1954, 48, 
162-166. 

PavLov, I. Conditioned reflexes. 
Oxford Univer. Press, 1927. 
RopNICK, E. H. 
and trace-conditioned responses. 

Psychol., 1937, 20, 409-425. 

STEWART, M. A., STERN, J. A., WINOKUR, 
G., & Frepman, S. An analvsis of GSR 

Psychol. Rev., 1961, 68, 60 


Marquis’ 
New 


New York: 


Characteristics of delayed 
J. exp. 


conditioning. 
67. 
Waite, C. T., & SCHLOSBERG, H 
conditioning of the GSR as a function of the 
J. exp. Psychol., 1952, 43, 


Degree of 


period of delay. 
357-362. 


(Received October 30 





Journal of Experimental Psychology 
1962, Vol. 64, No. S, 519-525 


ON-TARGET VERSUS OFF-TARGET INFORMATION AND 
THE ACQUISITION OF TRACKING SKILL! 


ALTON C. WILLIAMS? 


Ohio 


Knowledge of results or information 
feedback area of sub- 
stantial interest since the 
classic study of Thorndike (1927), the 
most recent review for motor skills 
being that of Bilodeau and Bilodeau 
(1961). The present study was con- 
cerned with a restricted topic within 
the more general area of knowledge of 
results: the influence of augmented 
feedback on the acquisition of skilled 
performance. As the term implies, 
augmented feedback represents know]- 
edge of performance in addition to 
that normally present in a skill task 
and it is defined as secondary in- 
formation supplemental to some pri- 


has been an 
research 


mary feedback signal (s). 


Of the numerous studies on acquisi 
tion of tracking skill only 
grav, 1948, pp. 195 
196) and by Payne and Hauty (1955) 
have employed augmented feedback 
based on an off-target criterion, the 


those by 
Rapparlie (see 


majority of Es having activated such 
signals when tracking error was within 
tolerance, i.e., when S was on-target. 
Thus, augmented feedback has been 
utilized in a majority of the research 
as a means of emphasizing “correct” 


responding, a use which seems logical 


Chis 
Laboratory of Aviation Psychology and w 
supported by the United States Navy under 
Contract No. N61339-836, sponsored by the 
United States Naval Training Device Center, 
Port Washington, New York. Permission is 
granted for reproduction, translation, publica- 


research was carried out in 


tion, use, and disposal in whole or in part for 
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in view of the Skinnerian position that 
errors or incorrect responses should 
be minimized in the construction of 
training tasks and programs, at least 
for discrete-verbal skills. 
it is 


However, 
an open question whether in 
skill tasks error 
information feedback might not be 
more effective than information on 
correct responding, and the present 
study was concerned, therefore, with 
the effects on tracking performance of 
augmented feedback based on an 
off-target criterion relative to that 
based on an on-target criterion. Fur- 
ther, a simple off-target criterion 
condition and an off-target condition 
providing directional information were 
compared. 


continuous motor 


\IETHOD 


Apparatus rhe skill task was defined by 
the SETA apparatus (Gain & Fitts, 1959) 
which provides S with a simple positional 
tracking task. The S tracked a 6 cpm + 12 
cpm sinusoidal signal via a one-dimensional 
compensatory display of tracking information 
tube. This display 
was noise free and provided S with his primary 
source of feedback information. Augmented 
feedback was provided to the experimental 
groups in the form of auditory clicks delivered 
to headphones worn by S. The clicks oc- 
curred at the rate of two per sec. when S was 
within or outside (depending upon the experi- 
mental condition) preset 
error. he 
attained by 


on a 5-in. cathode ray 


tolerance limits of 
system tolerance limits were 
voltage com- 
parators to the desired voltage levels such 
that a plate voltage relay 
whenever tracking error fell within the 
critical levels. Activation of the relay either 
closed a circuit from the click generator to the 
headphones or opened that circuit depending 
upon the experimental condition. Further, it 
was possible to activate only the left ear- 
phone when tracking error was to the left on 


adjusting two 


was activated 
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the visual display and only the right earphone 
in the opposite condition. The on-target 
band thus defined represented tracking error 
of rather small amplitudes: in terms of the 
visual display, the on-target band was +.08 
in. around the fixed target element. 
Performance was scored by electronically 
integrating the absolute value of tracking 
error. his is the analog of the average 
deviation of S's error amplitude distribution 
and will be identified as average error. The 
scores were transformed from voltage units 
to units of inches on the display scale; thus, 
average error as reported here may be inter- 
preted as the average deviation of S’s tracking 
error amplitude distribution plotted on the 
same scale used in the visual tracking display. 
In addition to average error, E recorded 
the amount of 
trial when 
tolerance 
formed to 


time during each 
tracking 
limits. 


tracking 
within the 
were trans- 
within tolerance 


erro! was 


These data 
percent time 
scores. 

Subjects and procedure.—The Ss were 88 
volunteer male undergraduates. Each S 
participated in five daily 30-min. sessions, 
and none of the Ss had previous experience 
with a laboratory tracking task. 

Twenty-two Ss were assigned via a 
procedure to each of four 
rhe control group, Group C, did not 
experience augmented feedback at any time 
during the experiment. The three experi- 
mental groups experienced the auditory 
clicks during the initial three and one-half 
training sessions whereupon augmented feed- 
back was withdrawn for the remaining one 
and one-half Group I received 
augmented feedback whenever tracking error 
was within the preset tolerance limits (see 
above), while Groups O and O-D received 
the signals whenever tracking error exceeded 
the on-target limits. For Groups I and O 
augmented feedback occurred simultaneously 
in both earphones, while for Group O-D the 
left earphone was activated when displayed 
error was to the left of the fixed target refer- 
ence on the visual display, and the right ear- 
phone was activated when error was to the 
right. 

All tracking trials were of 30-sec. duration 
and were administered in blocks of four trials 
rhere were 3 blocks of trials on the first and 
last sessions and 4 blocks in each of the other 
sessions, a total of 18 blocks. The initial 13 
blocks defined the training conditions for the 
experimental groups, while transfer to the 
no-augmented-feedback condition occurred 
over the final 5 blocks of trials. Rest periods 
of 30 sec. were provided between trials within 


randomization 
groups. 


sessions. 
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a block and 1.5 min. rest occurred between 
blocks. Performance over the 
final 25 sec. of each trial, which permitted the 
initial transients in tracking 
dampen out prior to scoring. 
The instructions to all Ss stressed the 
importance of maintaining zero error on the 
visual display, and each S was reminded of 
this ultimate 


was & ored 


behavior to 


goal at the beginning of each 


session 
RESULTS 
Average error.—Figure 1 provides a 
summary of tracking accuracy as de- 
fined by the average error scores. It 
may be noted that all three experi- 
mental groups attained proficiency 
levels superior to that of Group C 
Further, Group O was superior to both 
Groups | and O-D throughout both 
training and transfer, and there was 
considerable overlap between the lat- 
ter two groups during training. It 
appears, then, that augmented feed- 
back based on a simple off-target 
criterion (Group QO) resulted in per- 
formance superior to that attained 
either with an on-target criterion 
(Group 1) or with an off-target cri- 
terion which included directional in- 
formation (Group O-D). 


These observations are supported 
by the results of the Mann-Whitney 
test which was applied to the average 
error data for groups. 
error scores were summed for each S 


The tracking 


across all 18 trial blocks for 
analysis; thus, m; = me 
two-group 


this 
22 for each 
comparison performed. 
The alternative hypothesis tested here 
was that more Ss in one of a pair of 
groups were superior in tracking pro- 
ficiency than could occur by chance. 
A nonparametric analysis was em- 
ployed since marked heterogeneity of 
variance was found in the data (see 
Fig. 2). The results of the U tests 
are listed in the upper half of Table 1, 
where it may be noted that all group 
comparisons accepted the alternative 
hypothesis at P < .05 except that be- 
tween Groups I and O-D. 
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influence on 


Variability.—Augmented 
also exerted an 
variability within groups (intragroup 
variability), and these data are sum- 
marized in Fig. 2. The data plotted 
are intragroup SDs of the average 
error scores for each block of training 
and transfer trials. As with the aver- 
age error data (Fig. 1), the three ex- 
perimental groups are superior to the 
control group in terms of intragroup 
variability. However, here, Group O 
does not enjoy a clear superiority over 
Group O-D, as was the case in Fig. 1, 
but both Groups O and O-D do ex- 
hibit more inter-S homogeneity than 
that attained by Group I. 

These observations are supported 
by the results of U tests which were 
applied to each pairing of groups. It 
was not possible to test the same 
hypothesis as in the case of the aver- 
age error data; the 


instead, alter- 


<ing error for all groups during traini 


inter-S 


g and transtet 


with the 
intragroup SDs was that more intra- 
group SDs for one of a pair of groups 
would be smaller in value than those 
for the other group than could occur 
by chance. For 
18. 


native hypothesis tested 


each comparison 
The results are listed 
in the lower half of Table 1 where it 
may be seen that (a) Groups O and 
Q-D exhibited more intragroup homo- 
geneity than did Group I, and (0d) all 
three experimental groups were su- 
perior in this regard to Group C. It 
may be concluded, then, that aug- 
mented feedback not only results in a 
higher level of performance accuracy 
but also individual differences within 
groups are less than is the case for Ss 
who do not experience these addi- 
tional feedback cues. 

Time within tolerance.—The per- 
centage of time S spent within the 
+ .08-in. tolerance limits (TWT) is 


wy = Ne = 
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summarized for each four-trial block 
in Fig. 3. While the differences 
among experimental groups are not as 
apparent as with the average error 
data, there are statistically significant 
(P < .01) differences both during 
training and transfer (see Table 2). 


Inter-S variability for all groups during training and transfer. 


The pattern of group proficiency 
levels on TWT differs in an important 
way from that found with the average 
error data: in terms of the TWT 


scores Group | is significantly superior 
to all other groups during training at 


P < .05. This rank order for Groups 


rABLE 1 


RESULTS OF THE 


MANN-WHITNEY 


U Test Aprpiiep To AVERAGI 


ERROR AHD INTER—S VARIABILITY DATA 


Performance Criterio 


\verage 

Error 

(nm; = mz = 22 
Variability 

(tn; =n = 18) 


Groups Compared * 


1/0O-D 1/o 0-D/O 
270 330 339 
0.66 2.07 2.28 
.510 .038 .022 
46 63 149 
<.002 <.002 >.10 


* For average error: less /more accurate group; for variability: more /less variable group. 
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l and O is just the reverse 
noted for the average error 
where Group © was superior, 


TABLE 2 


\NALYSIS OF 
WITHIN 


VARIANCE 
TOLERANCE 


PERC 
FOR 


OF 
DATA 


AND TRANSFER 


Source 


Training 
Groups (G) 
Ss within groups 
(Ss/G) 
Blocks (B) 
BXG 
B X Ss/G 
Transfer 
Groups (G) 
Ss within groups 
(Ss/G) 
Blocks (B) 
BXG 
B X Ss/G 


*P <.05. 
oP < Ol 
“PP < .001 


2741.13 


379.54 
2470.55 
20.02 


1008 13.52 


1314.14 


246.16 
17.18 
39.41 
14.20 


of that 
measure 
and this 


ENT TIMI 
TRAINING 


Transfer 


Percent time within tolerance for all groups during training and transfer 


reversal serves to define how Ss of 
Groups O and I responded differ- 
entially to the task (see below). 

Finally, it may be noted that Group 
| actually deteriorated during transfer 
while Groups C, O, and O-D either 
“held their own” improved in 
terms of TWT. This deterioration 
gave rise to the significant Blocks 
x Groups interaction in the transfer 
analysis of Table 2. 


or 


Disc USSION 


The majority of past research in this 
area has found superior performance by 
groups trained augmented feed- 
back compared to no-augmented-feed- 
back control groups. In this regard, the 
present research merely confirms those 
previous data. The contribution of the 
present study is twofold: first, the use of 
an off-target criterion for the activation 
of augmented feedback was found to be 
superior to an on-target criterion, the 


with 
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latter being the most common criterion 
used in previous research; and second, 
for the first time it has been noted here 
that not only accuracy but also group 
homogeneity of tracking performance can 
be improved significantly by the use of 
augmented feedback, especially 
based on an off-target criterion. 


when 


In addition to these major findings, it 
is of interest to note several other points. 
First, from Fig. 1 and Fig. 3 it may be 
seen that no deterioration in performance 
occurred for either Group O or Group 
O-D at transfer (augmented feedback 
withdrawn), but Group I does exhibit 
some deterioration on Transfer Blocks 
1 and 2 of Fig. 1 and over the entire 
transfer session in Fig. 3. This is logical 
in view of the similarity between the 
training and transfer trials for Groups O 
and O-D, i.e., during training these Ss 
experienced a diminution in number of 
auditory clicks as a result of increasing 
skill. Group I, however, experienced a 
marked change in going from training to 
transfer, i.e., these Ss experienced in 
creasing amounts of auditory clicks with 
training followed by an abrupt change to 
no-augmented-feedback at _ transfer. 
These observations were supported by 
statistical analyses of the transfer data: 
Groups O and O-D were superior to 
Group C throughout transfer in terms of 
tracking accuracy (P < .05), but Groups 
I and C did not differ (P > .05) from the 
first transfer trial to the last. 

The reason for the loss of superiority 
by Group I over Group C may be found 
in Fig. 2: upon transfer Group I ex- 
hibited an increase in inter-S variability 
considerably greater than the modest 
increase shown by Groups O and O-D. 
It follows, then, that the transfer pet 
formance deterioration by Group | 

tracking 
accuracy (Fig. 1) but also an increase of 


involved not only a loss in 


within-group variability (Fig. 2). Since 
apparent accuracy deterioration is not 
noted for Groups O and O-D, and since 
the increase in inter-S variability was 
relatively less than that for Group | 
during transfer, it follows that an off 
activation of 


target criterion for the 
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augmented feedback is to be preferred 
as a training variable. 

A second point of interest can be 
introduced by again noting from Fig. 1 
that, in terms of average error, Group O 
is superior throughout to Group I. Now, 
the average error metric actually is the 
average deviation (AD) of S’s tracking 
error amplitude distribution, and thus 
the smaller average error for Group O 
indicates that, on the average, those Ss 
generated smaller errors than did Group 
I. However, Group I actually spent 
more time within the +.08-in. tolerance 
limits (see Fig. 3). It follows, then, that 
the shape of the error amplitude dis- 
tribution for Group O must differ from 
that of Group I. 

What might be the forms of the error 
amplitude distributions for Groups O and 
1? The previous data of Bahrick, Fitts, 
and Briggs (1957) suggest that if the 
distribution of tracking error for Group | 
is nonnormal (as in fact is the case for 
the present data), that distribution is 
probably leptokurtic. On this basis, it 
follows that Group O generated a 
bimodal error amplitude distribution, as 
this is the only distribution that could 
result in both smaller AD and smaller 
percent time within tolerance scores for 
Group O compared to Group I. In 
other words, Ss of Group O spent con- 
siderable time tracking closely around 
the two tolerance limit points and made 
relatively few errors of large amplitude, 
while Group I spent more time within the 
tolerance limits but committed occasional 
large tracking errors. 

These deductions from the data, then, 
lead one to conclude that tracking be- 
havior with augmented feedback based 
on an off-target criterion differs funda- 
mentally from that when such feedback 
is based on an on-target criterion: in the 
former case large errors are emphasized, 
and S apparently learns rather quickly 
to minimize these occasional lapses in 
tracking accuracy, while in the latter case 
the importance of small errors is em- 
phasized, but apparently S does not 
respond as quickly or as efficiently to 
correct for large tracking 
Since minimizing large tracking 


occasional 
errors. 
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errors is a primary task for S early in 
training, it follows that augmented feed- 
back based on an off-target criterion 
should be particularly helpful in shaping 
the desired behavior during early ac- 
quisition of a continuous control skill. 
It is problematical whether or not the 
above generalization will hold in discrete- 
verbal learning tasks. However, it is 
suggested that if the element of discovery 
is present in such discrete tasks, some 
criterion for the activation of augmented 
feedback the off-target 
criterion in the continuous case would be 
helpful in focusing S’s attention on 
response alternatives reasonably close to 


analogous to 


the correct alternative. 

Finally, it is interesting to note that 
the additional feedback information 
available to Group O-D (off-target direc- 
tion of error) did not provide for a 
performance level superior to that of 
Group O which augmented 
feedback on a more simple, nondire 
tional off-target criterion. 
explanations for the relative 
inferiority of Group O-D are suggested. 


received 


Tw 0 


First, it is possible, of course, that the 
addition of a directional cue to the off- 
target was not particularly 
useful information to Group O-D since 
directional relationships between control 
and display movements are one of the 
most simple aspects of a tracking task to 
be learned. Secondly, it is possible that 
the information on error direction was 
actually disruptive. It may be recalled 
that auditory clicks were delivered to th 
left earphone when error was to the left 
on the visual tracking display. Thus, a 
signal in the left earphone indicated that 
S should move his control to the right 
While data are lacking on population 


criterion 


stereotypes in such a stimulus-respons¢ 
task, it is probable that a more com 
patible S-R arrangement would be one in 
which a signal in, say, the Jeft earphone 
indicates a control movement is required 
to the left. It is suggested that both of 


the above possibilities were responsible 
for the inferior performance of Group 
O-D relative to that of Group O. 
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SUMMARY 


During training three experimental groups 
received augmented feedback (auditory clicks 
at the rate of two per sec.) when tracking 
accuracy was within 
or outside (an 
tolerance limits. During transfer, no aug 
mented feedback was provided All three 
experimental groups were superior in tracking 
accuracy during training to a control group 
which did not receive augmented feedback 


an on-target criterion 


off-target criterion) fixed 


Of the three experimental groups, the group 
augmented feedback 
was superior to a group which 
perienced clicks when on-target It was 
superior also to a group which received clicks 
when off-target but differentially 
to the direction of tracking error. 

During 


receiving when off 


target ex- 


according 
transfer both off-target groups 
remained superior in tracking accuracy to the 
control group, but the on-target group and the 
control attained 
formance 

It follows that augmented feedback based 
on a simple off-target criterion was the most 
effective training condition. 


group comparable pet 


An analysis of 
the data suggested that this superiority was a 
result of the emphasis an off-target criterion 
places on occasional large tracking errors 
The group trained on this condition ap- 
parently learned to reduce such errors more 
quickly and efficiently than did the on-target 
criterion group. 
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RESISTANCE TO EXTINCTION AFTER VARYING AMOUNTS 
OF DISCRIMINATIVE OR NONDISCRIMINATIVE 
INSTRUMENTAL TRAINING! 


M. R. D'AMATO, DONALD SCHIFF, anp HARRY JAGODA 


New York University 


Resistance to extinction and nuta- 
ber of reinforced responses (acquisi- 
tion level) have classically been 
thought to be monotonically related. 
Recently, however, doubt concerning 
the monotonicity of the relationship 
has been raised in several quarters 
(e.g., Birch, Ison, & Sperling, 1960; 
Murillo & Capaldi, 1961; Senko, 
Champ, & Capaldi, 1961). Although 
nonmonotonicity between resistance 
to c.-tinction and acquisition level has 
b.en reported in a rather large 
number of recent studies, many of 
these are of questionable -relevance 
because either they made use of some 
type of an intermittent reinforcement 
1958) 


schedule (e.g., Capaldi, 1957, 
or they employed a discriminative 
rather than a nondiscriminative task 
(Murillo & Capaldi, 1961), in some 
cases ‘“‘extinction’”’ actually constitut- 
ing reversal learning (e.g., Senko et al., 


1961). In certain other studies 
reporting nonmonotonicity and not 
subject to either of the preceding 
objections (Lewis & Duncan, 1956, 
1958), one finds other reasons for 
questioning their pertinence for the 
present problem, such as the use of 
human Ss in money payoff situations, 
which bear only the slightest re- 
semblance to the strictly instrumental 
situations in which, with animal Ss, 
the relationship of monotonicity was 
originally established. 

! This research was supported by Research 
Grant M-2051 from the National Institute 
of Mental Health, National Institutes of 
Health, United States Public Health Service, 
and Grant G-14724 from the National 
Science Foundation. 


Of the several published studies falling 
in the latter category, two have reported 
resistance to extinction and acquisition level 
to be nonmonotonically related. North and 
Stimmel (1960) found that extinction of the 
running response proceeded more rapidly for 
Ss given 90 or 135 rewarded trials in a runway 
than for Ss given 45 reinforced trials. Wilson 
(1958), on the other hand, found no evidence 
of a reduction in resistance to extinction, even 
though he carried (runway) acquisition to 
480 trials. Harris and Nygaard (1961), 
working with thirsty rats in a free operant 
situation, observed the usual monotonic 
relationship between acquisition level and 
resistance to extinction of the bar pressing 
response up to 360 reinforced responses, the 
highest acquisition level they worked with. 
Under rather similar conditions, Margulies 
(1961) reported monotonicity up to 1,000 
trials; and as described in an earlier paper 
(D'Amato & Jagoda, 1962), unpublished 
results from our laboratory suggest persistence 
of the monotonic relationship up to some 
7,000 rewarded bar pressing responses, again 
with thirsty rats as Ss. Finally, in conflict 
with the preceding results, King, Wood, and 
Butcher (1961) recently reported a reduction 
in the resistance to extinction of pigeons 
receiving 600 or 900 reinforced key pecks, 
as compared to Ss permitted 300 rewarded 
responses. 


In summary, the results from 
studies employing a nondiscriminative 
(simple) instrumental response are 
equivocal, though they suggest that 
under certain conditions the mono- 
tonicity assumption holds up _ to 
several thousands of responses. 

The present study had three ob- 
jectives. First, all of the cited Skinner 
box studies in which monotonicity was 
obtained employed the thirst drive 
and water reward. It would be of 
some value to demonstrate that mono- 
tonicity also holds in the Skinner box 
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setting for the hunger drive and food 
reward, particularly since all other 
relevant studies have been conducted 
under the latter motive-incentive 
conditions. Second, we wished to 
carry acquisition training beyond the 
level achieved in previously published 
reports, to a maximum of 1,600 
reinforcements. Third, and most 
important, we wished to assess the 
influence of two different training 
procedures, discriminative versus non- 
discriminative instrumental training, 
on the shape of the function relating 
acquisition level and resistance to 
extinction. The hypothesis under 
examination was that the traditional 
monotonic relationship would prevail 
for Ss given nondiscriminative train- 
ing, but not for Ss given discrimina- 
tion training, the latter being ex- 
pected to show a significant decline 


in resistance to extinction with pro- 


longed acquisition training. 

One plausible rationale for the 
preceding hypothesis is as follows. 
In general, Ss trained on a dis- 
crimination program are essentially 
on an intermittent reinforcement 
schedule (with respect to the experi- 
mental situation) until the discrimina- 
tion is firmly acquired; beyond this 
point their experience is to all intents 
and purposes one of continuous rein- 
forcement (since they then make few 
errors, or respond little in S4). 
Prolonged discrimination training, 
then, has the effect of carrying Ss 
beyond their intermittent reinforce- 
ment experience well into the con- 
tinuous reinforcement segi.ent. If 
it is assumed that directly following 

intermittent reinforcement ex- 
perience resistance to extinction is 
greater than when a prolonged con- 
tinuous reinforcement segment inter- 
venes between the intermittent rein- 
forcement experience and extinction, 
the hypothesis follows. 


METHOD 


Subjects.—The Ss that completed the 
study were 94 experimentally naive albino 
rats (51 males and 43 females) 53 to 94 days 
of age at the start of the study. One S was 
discarded because of a programing error and a 
second due to illness 
laboratory. 

Apparatus. —lwo Grason-Stadler two-bar 
Skinner boxes were used; the right bars were 
removed from both boxes, converting them 
into single-bar boxes. A force of approxi- 
mately 20 gm. was required to activate the 
bar microswitches in the two boxes. 

Design.—The experiment was run in two 
replications (of unequal NV), but since the 
results of the two replications were quite 
similar in form, replications was not included 
as a factor in the statistical design. 

There were four levels of acquisition, 
defined in terms of the number of reinforced 
responses allowed after pretraining: 200, 400. 
800, and 1,600. At each acquisition level 
there were two separate groups of Ss that 
differed in the type of training received. The 
“I groups, designated as 1[(200), 1 (400), 
1 (800), and 1(1600), underwent simple (non- 
discriminative) instrumental training, each 
of the four groups being composed of 12 Ss 
The “D” groups, D(200), D(400), D(800), 
and D(1600), were trained on a 
brightness 


All Ss were bred in our 


successive 
discrimination problem. There 
were 11 Ss in the first two groups and 12 each 
in the last two groups. 

All Ss of a replication were quasirandomly 
assigned to the eight groups, in part balancing 
litters over groups, and placed on deprivation 
at the same time. The I and D groups of 
a given acquisition level started acquisition 
training together; the scheduling of the 
beginning of acquisition was so arranged for 
the various reinforcement groups that all 
groups entered extinction within a day or 
two of each other 

Deprivation training.—One week prior to 
the beginning of pretraining, Ss were placed 
on a 22-hr. food deprivation training regimen, 
water being constantly available in the home 
cages. Four days later, the feeding period 
was reduced to 1} hr. daily and remained at 
that duration throughout the study. 

Skinner box pretraining.—There were 3 pre- 
training days. On Days 1 and 2, 50 rein- 
forcements (45-mg. Noyes rat pellets) were 
given in the conditioning of approach re- 
sponses to the food tray at the sound of the 
feeder magazine. On Day 3, Ss were shaped 
on the bar pressing response with 25 to 50 
reinforcements. The stimulus conditions in 
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the boxes during pretraining were the same as 
prevailed during simple instrumental training 
and during the S? portion of discrimination 
training. 

Discrimination training.—All D_ groups 
were exposed to a simple brightness dis- 
crimination situation. Illumination of the 
left (white jeweled) stimulus light constituted 
S and provided an illumination level of 7 to 
10 ft-c, measured with the target of a Weston 
illumination meter placed at bar level and 
? in. in front of the lens of the stimulus light. 
Measured under the same conditions the 
illumination under S4, which was produced 
by the shielded house light, read 
0.1 ft-c. 

All S® periods were 45 sec. in duration and 
alternated with S* periods varying between 
33 and 65 sec. in length. 
the programing film tape took about 6 min. 
and provided approximately equal total times 
in SP and S*. Since 50 reinforced responses 
in SP were allowed each day, Groups D (200), 
D (400), D(800), and D(1600) required 4, 8, 
16, and 32 training days, respectively. 

Simple instrumental training.—The treat- 
ment of the I groups was exactly the same as 
that accorded the corresponding D groups 
except for the elimination of the S* periods, 
i.e., the left stimulus light was always on. 
As in the D groups, 50 reinforcements were 
permitted on each acquisition day. 

At the end of the daily 50 reinforced re- 
sponses, Ss of all groups entered a time-out 
period and were quickly removed from the 
Skinner boxes to their home cages where, no 
sooner than 15 min. later, they were fed for 
The 


number of hours of food deprivation at the 


about 


One revolution of 


the 14-hr. period. estimated average 
start of a day’s session was 22 hr. 
Extinction.—The procedure, 
which began the day following the termination 
of acquisitian, was precisely the same for all 
Ss and consisted of one daily 10-min. period 
on each of 5 successive days with the left 
stimulus light illuminated, i.e., in the former 
SP condition. The duration of the extinction 
periods, 10 min., was judged to be short 
enough to resemble closely the length of the 
acquisition sessions (which overall averaged 
7.15 min.) and yet long enough to sample 
adequately the extinction process. It should 
be pointed out, however, that because of the 
absence of S* periods, the extinction sessions 
bore a greater similarity to the acquisition 
sessions of simple instrumental training than 
to those of discrimination training. The 
exclusion from extinction of S* periods (more 
correctly, periods in which the former S* 


extinction 
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stimulus present) based on two 
considerations: (a) We wished to make direct 
comparisons of the extinction performance of 
corresponding I and D groups, which would be 
feasible only if the extinction sessions were 
identical for all Ss. (0) 
periods would have raised problems concern- 
ing the treatment of responses made in S+, 
since the number of such responses most 
probably would be related to the acquisition 
level variable. 

It should be recorded that the cue provided 
by activation of the feeder magazine was 
maintained during extinction. 


was was 


Inclusion of such 


Deprivation 
level at the start of extinction sessions was 
confined within the limits of 21 

During the course of the experiment the 
relative humidity ranged between 58°; and 
70%, and the temperature 


and 78° F. 


between 70 


RESULTS 


Discrimination acquisition.—A dis- 
crimination ratio (DR), obtained by 
dividing the number of responses 
made in S4 by the number performed 
in S?, was calculated for each S after 
every daily training session. Ina plot 


of the daily means of the DRs, Group 


D(1600) showed some improvement 
over the level of discrimination finally 
achieved by Group D(800). To 
evaluate this difference the mean DR 
over the last 8 acquisition days was 
calculated for every S of Group 
D (1600), as was the mean DR over 
the last two sessions for each S of 
Group D(800). The group means 
based on these measures (.141 and 
.218, respectively) did not, however, 
differ significantly (¢ = 1.67, df = 22, 
P = .12). 

Extinction in the I groups.—The 
number of responses made by each S 
in each of the five daily extinction 
sessions was converted to common 
logs, providing the basis for all 
statistical analyses. An overall index 
of resistance to extinction ob- 
tained for each S by summing its 
five daily log scores. 

The first question of interest con 
cerns the relationship observed be- 


was 
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Fic. 1. Relationship between acquisition 
level (number of reinforcements) and resist- 
ance to extinction in the simple instrumental 


1) and the discrimination (D) groups 


tween resistance to extinction and the 


number of reinforced responses al- 
The solid 


line in Fig. 1, based on the means of 


lowed during acquisition. 


the summed log scores, presents this 
relationship. It from the 
figure that there is no tendency for 
resistance 


is plain 


decrease 
with increasing training, even when 
acquisition is carried to 1,600 rein- 
forced responses ; in fact, the solid line 


to extinction to 


suggests an increasing trend. 

The most powerful (and specific) 
way of analyzing the present data is 
by a trend analysis of the linear and 
quadratic components of the curve 
(Grant, 1956). The presence of a 
significant quadratic component (with 
negative sign) alone or in accompani- 
ment with a significant negative 
linear component would constitute 
evidence for a nonmonotonic relation- 
level and 
A significant 
component in 


ship between acquisition 
resistance to extinction. 
negative linear con- 
junction with an insignificant quad- 
ratic would, essentially, support the 
same interpretation. On the other 
hand, an insignificant quadratic com- 
ponent would support the assumption 
of monotonicity if the 
ponent 
positive in 


linear com- 


were either insignificant or 


The latter would 


Sigh. 


indicate that the curve was still rising, 
while the occurrence of insignificance 
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in both the linear and quadratic com- 
ponents would signify that the func- 
tion was asymptotic. 

In order to take into consideration 
the unequal spacing of the independ- 
ent variable, the coefficients of the 
orthogonal polynomials were calcu- 
lated in the manner suggested by 
Grandage (1958). Analysis of vari- 
ance showed both the linear and the 
quadratic components of the trend to 
be insignificant (7 = 1.29 and 0.59, 
respectively, df = 1/44). Thus, for 
the numbers of rewarded responses 
employed in this study, the function 
relating resistance to extinction and 
acquisition level in the nondiscrimina 
tively trained Ss 
asymptotic. 


was essentially 

Despite the absence of significant 
differences in the summed log ex- 
tinction scores, would like to 
know whether extinction in the four | 
groups followed a parallel develop- 
ment over the five extinction sessions. 
Figure 2 presents the extinction curves 
of the four I groups 
extinction days. A 


one 


the 5 
analy sis 


over 
trend 
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Fic. 2 Extinction curves of the four 
simple instrumental groups, based on the log 


of the number of daily extinction responses 
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(Edwards, 1960) was applied to the 
data and the differences among the 
groups’ linear components evaluated. 
The appropriate F (2.38, df = 3/176) 
fell short of accepted significance 
levels (P =~ .07). Thecurves of Fig. 2 
suggest that more impressive differ- 
ences among the linear components of 
the groups’ trends, as well as among 
their overall extinction scores, might 
have been obtained if extinction had 
been carried one or two 
further. At any rate, the data of the 
figure plainly show that Group | (1600) 
is not inferior to any of the other | 
groups. 

Extinction in the D groups.—The 
first concern is again with the relation- 
ship between acquisition level and 
resistance to extinction, which, as 
may be seen from Fig. 1, is vastly 
different from that obtained with the 
I groups. A trend analysis of the 
curve revealed a_ significant 
component (/ = 4.13, 
P < .05), as well 
quadratic component (F = 6.31, 
df = 1/42, P < .025). Because of 
the unequal Vs in the four discrimina- 
tion groups, the trend analysis was 
based on groups means, rather than 
sums, in the manner suggested by 
Walker and Lev (1953). 

Figure 3 shows that the inferiority 
of Group D(1600) relative to Groups 
1) (400) and D (800) is present through- 
out extinction, though it appears most 
marked on the last 2 extinction days. 
Once again the differences among the 
groups’ linear trends were evaluated 
and once again the resulting F (2.51, 
df = 3,168) was quite close to ac- 
cepted significance levels (P ~ .06), 
suggesting differences the 


sessions 


linear 
df = 1/42, 


as a_ significant 


among 
slopes of the extinction curves. 


Thus, in contrast to the | groups, 
the discriminatively trained Ss, after 
an initial 
reduction in resistance to extinction 


increase, reveal a sharp 
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Fic. 3. Extinction curves of the four 


discrimination groups, based on the log of the 
number of daily extinction responses. 


as a function of increasing acquisition 
training. Further, the nonmono- 
tonicity appears in the early stages of 
extinction, tending to be somewhat 
more marked during the latter ex- 
tinction days. 

Comparison of extinction in the I and 
D groups.—lt it is in fact true that 
discrimination training, at least in its 
early phases, provides an intermittent 
reinforcement experience, then resist- 
ance to extinction should be greater 
in the combined D groups than in the 
| groups. Analysis of the differences 
between the means of the summed log 
scores of the combined four D groups 
(7.43) and the four | groups (6.96) 
revealed that discrimination training 
did indeed lead to significantly greater 
resistance to extinction (f = 2.53, 
df = 86, P < .02). 

It will be observed in Fig. 1 that a 
reversal in the extinction curves of the 
| and D groups occurs in the 1600 
groups; the reversal is, however, far 
from significant (¢ = 1.25, P > .20). 
Finally, tests of differences between 
the means of the I and D groups 
receiving 200, 400, and 800 reinforce- 
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ments yielded, in the same order, the 
following results: ¢ = 1.45, P > .10; 
t = 2.25, P < .05;¢ = 2.63, P < .02. 


DISCUSSION 


In general, our results support the hy- 
pothesis that the presence or absence of 
monotonicity in the function relating 
acquisition level to resistance to extinc- 
tion depends importantly on the type 
of learned response under consideration. 
In agreement with earlier reports (Harris 
& Nygaard, 1961; Margulies, 1961) 
monotonicity was found with a non- 
discriminative (free operant) response; 
with a discriminative response, on the 
other hand, a marked nonmonotonicity 
between acquisition level and resistance 
While the 
present results do little to explain the 
nonmonotonicity obtained in the pre- 
sumably nondiscriminative situations of 
North and Stimmel (1960) and King, 
Wood, and Butcher (1961), they perhaps 
shed some light on other, related, studies 
in which a strong discriminative com- 
ponent was present (e.g., Murillo & 
Capaldi, 1961; Senko et al., 1961). Con- 
ceivably, they also possess some relev- 
ance for the overlearning reversal effect 
(the learning of Ss 
receiving extensive overtraining), inas- 
much as that phenomenon has been 
attributed by some to a nonmonotonic 
relationship 


to extinction was observed. 


faster reversal 


between acquisition level 
and resistance to extinction of the ap- 
proach response to the originally positive 
stimulus (e.g., Birch et al., 1960). 
Turning now to a possible mechanism 
by which the nonmonotonicity of the 
discrimination groups might be ex- 
plained, two separate factors seem to be 
involved. First, it probably is safe to 
that the intermittent reinforce- 
ment experience unavoidably associated 
with the early phases of discrimination 
training has the effect of augmenting 
resistance to extinction in Ss receiving 
moderate 


assume 


discrimination 
This assumption is supported 
in the present study by the superior 
resistance to extinction of the combined 
discrimination groups, with further veri 


amounts of 
training. 


$31 


fication coming from an earlier study 
by Jenkins (1961a), who worked with 
pigeons in a discrete trials situation. 
Second, it is still necessary to explain 
how resistance to extinction becomes 
depressed with overtraining, and two 
possibilities suggest themselves. The 
first one is that mentioned earlier, 
namely, that overtraining has the effect 
of carrying Ss beyond their intermittent 
reinforcement experience well into a 
region of virtual continuous reinforce- 
ment. However, there are the following 
difficulties with this possibility. (a) Sev- 
eral studies (e.g., Jenkins, 1961b; Theios, 
1962) have failed to demonstrate a 
reduction in resistance to extinction as a 
result of interpolating a continuous rein- 
forcement segment between a partial 
reinforcement experience and subsequent 
extinction. (b) There is evidence that 
nonmonotonicity between acquisition 
level and resistance to extinction can 
occur in situations where responding to 
the negative stimulus is under the con- 
trol of E rather than S, as in successive 
discrimination training on a straight- 
away (Birch et al., 1960). (c) Finally, 


the nature of the present argument is 
such that, in principle, no amount of 


discrimination 
resistance to 


training could reduce 
extinction below that of 
a comparable nondiscriminative group. 
The results obtained with the 1,600 
groups of the present study suggest that 
su h a reversal is a distinct possibility 
if acquisition carried 
further. 

An alternative interpretation of the 
basis of the nonmonotonicity maintains 
that the discrimination experience is 
vital completely apart from attending 
changes in the effective reinforcement 
schedules (cf. Murillo & Capaldi, 
1961, who found nonmonotonicity 
only in those Ss that had learned the 
discrimination presented them during 
acquisition). Thisis a position, however, 
that is difficult to specify with any degree 
of precision. Nevertheless, it is sug- 
gestive that in Group D(1600) goodness 
of discrimination (the reciprocal of the 
mean of the DRs the last 10 
acquisition days) was negatively cor- 


were somewhat 


over 
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related with resistance to extinction 
(summed log R), with rho equal to .53 
(05 < P < .10). 


SUMMARY 


This experiment investigated the hy- 
pothesis that acquisition level and resistance 
to extinction would be monotonically related 
for a simple (nondiscriminative) instrumental 
response (bar pressing in a Skinner box), but 
the function would be nonmonotonic for a 
comparable discriminative response (successive 
brightness discrimination). Four groups of 
Ss were trained on the simple instrumental 
response and allowed 200, 400, 800, or 1,600 
reinforced responses, 50 per day. The same 
numbers of reinforced responses were given 
to four corresponding groups of discrimina- 
tively trained Ss, the procedure employed 
with the latter differing only in the insertion 
of occasional S* periods. All groups were 
exposed to one 10-min. extinction period (in 
the former S*) on each of 5 successive days 
lrend analyses of the data supported the 
initiating hypothesis; and as expected, the 
discriminatively trained Ss were, as a group, 
more resistant to extinction than the Ss 
trained on the simple instrumental response. 
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T MAZE REVERSAL LEARNING AFTER SEVERAL 


DIFFERENT OVERTRAINING 


WINFRED F. HILL, 


PROCEDURES ! 


NORMAN E. SPEAR 


Vorthwestern University 


AND 


Lande hy 
Several investigators (Brookshire, 
Warren, & Ball, 1961; Capaldi & 
Stevenson, 1957; North & Clayton, 
1959; Pubols, 1956; Reid, 1953) have 
demonstrated that overlearning of a 
discrimination facilitates subse- 
quent This effect at first 
seems paradoxical, since it appears to 
imply that increasing the number of 
reinforced trials to one cue weakens 


its 
reversal. 


the tendency to respond to that cue. 

Several mechanisms have been sug- 
gested which might explain this over- 
learning-reversal effect (ORE). (a) 
rhe practice in making 
choices during the overtraining may 
This 
might be mediated by acquired ob- 
serving 


Reid 


additional 
facilitate subsequent reversals. 
responses, as suggested by 


(1953) Pubols (1956). 
b) The long series of rewards may 


and 


make a change more discriminable 
und make reversal learning 
faster, a suggestion made by Capaldi 
and (1957). c) The 
greater number of rewards, by build- 
ing up stronger 7<’s, 


hen e 
Stevenson 
result in 


greater frustration and hence greater 
disruption 


may 


reward 
follow the accustomed response. 


when ceases to 
This 
is the explanation offered by North 
and Stimmel (1960) for their finding 
that overlearning tacilitated extinc- 
tion in a straight alley. It 


consistent with Birch, 


is also 
Ison, and 

! This research was supported by Grant 
G-8706 from the National Science Foundation 
and was conducted at Northwestern 
versity. 


Uni- 
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t L'niversity 


Sperling’s (1960) finding that with 
single stimulus presentation the over- 
learning-reversal effect is mainly at- 
tributable to the rate of extinction 
of the old response. (d) The long 
series of trials on which nearly every 
response is rewarded may reduce, 
through any of various mechanisms, 
the tendency to avoid the incorrect 
cues, thus making it easier to approach 
those cues when they become correct. 
This explanation is suggested by 
D’Amato and Jagoda (1961). (e) So 
much stimulus satiation may be built 
up to the correct cues that there is a 
tendency to avoid these cues as soon 
as they cease to be associated with 
reward. Though this has not been 
suggested as an explanation for the 
ORE, it with re- 
search summarized by Glanzer (1958). 

The present study attempted to 
provide evidence concerning the rela- 
tive importance of these various 
mechanisms for the ORE. Four 
groups were compared. Two were the 
groups found any study of the 
ORE: Group N reversed as soon as 


seems consistent 


in 


the discrimination was learned, Group 
Fr given overtraining. 
groups were also given overtraining, 
but with special features. 
was forced to the correct side on all 


The other two 
Group Co 
its overtraining trials, thus being 
deprived of the opportunity for mak- 
ing Group In 
twice as many overtraining trials as 
Groups Fr and Co, with half forced 
the correct and half the 


choices. was given 


to to in- 
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correct side. This group was thus 
given more experience with the in- 
correct side than any of the others. 
lt was predicted that Group Fr would 
reverse fastest and either Group N 
or In slowest, with the exact positions 
of the groups throwing light on the 
various explanations of the ORE. 


EXPERIMENT | 
Method 


Subjects and apparatus.—The Ss were 64 
experimentally naive female albino rats of the 
Sprague-Dawley strain, 74 to 75 days old 
at the beginning of experimental training. 
One S was discarded because of apparatus 
failure and was replaced by another S of the 
same description. The apparatus was the 
narrow T maze described by Cotton, Lewis, 
and Metzger (1958). This is an enclosed T 
maze with a 4-ft. stem and 9-in. arms (exclud- 
ing goal box). The wooden doors beyond the 
choice-point, utilized to prevent retracing, 
were also used for forcing purposes in this 
experiment. 

Prehandling.—Each S received six daily 
3-min. sessions of prehandling, the last one 
48 hr. prior to the beginning of experimental 
training. During each session S was allowed 
to explore a large unpainted wooden box, 
presented with four of the pellets later to 
serve as reward, and picked up and replaced 
at least five times by E. Pellets uneaten 
were returned with S to the home cage along 
with a fifth pellet. A once-daily feeding 
schedule began on the first day of prehandling 
and was maintained throughout experimental 
training. The ration was 10 gm. of finely 
ground Purina lab chow and was presented 
30-35 min. after the start of prehandling or 
experimental training 

Experimental training.—On each trial, 
after placement into the start box, the door 
was opened as soon as S was oriented toward 
it. Located just outside the start box was a 
treadle which, when S stepped on it, started 
two 1/100 sec. clocks. The first clock stopped 
automatically when S stepped on a treadle 
located just before the choice-point. The 
second clock stopped when S stepped on a 
treadle just beyond the choice-point in either 
arm of the maze. The side designated as 
correct for a particular S was baited with 
two 45-mg. Noyes pellets presented in a 
slightly bent tin lid. No lid was present on 
the nonrewarded side. For half the Ss, the 
right side was designated correct; for the 
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others the left was correct. The noncorrection 
procedure was used throughout, and the 
duration of goal-box confinement was ap- 
proximately 10 sec. regardless of choice. 
Between trials S was kept for 15 sec. in a 
carrying cage before being placed in the start 
box for the next trial. 

General design.—There were three stages 
to the experimental training—acquisition, 
overtraining, and reversal. The experimental 
variable was manipulated: only in the over- 
training stage. However, prior to the 
beginning of acquisition, Ss were randomly 
assigned to four experimental groups of 16.Ss 
each. The three stages of training required a 
total of 13 days for all groups. 

Acquisition.— Twelve free trials were given 
on each of 3 consecutive days. Groups Fr, 
Co, and In received acquisition on Days 1, 2, 
and 3 of acquisition. Group N, however, 
received acquisition on Days, 1, 2, and 11. 

Overtraining.—On Days 3-6 and 8-10 
Group N was treated exactly as it had been 
during prehandling and was given no T maze 
experience during this time. It thus served 
as a no-overtraining control group. Group Fr 
was given 15 free trials on each of Days 4-6 
and 8-10 and 12 free trials on Day 11, making 
a total of 102 free overtraining trials. Group 
Co was given 15 trials, all forced to the correct 
side, on each of Days 4-6 and 8-10, and 12 
trials, also forced correct, on Day 11. This 
made a total of 102 forced-correct overtraining 
trials. Group In was given 30 trials on each 
of Days 4—6 and 8-10 and 24 trials on Day 11. 
All of these 204 trials were forced, half to the 
correct side and half to the incorrect. The 
distribution of correct and incorrect forced 
trials determined randomly, and was 
different for each S. On Day 7, all Ss in all 
groups were fed on schedule but were not 
handled. 

Reversal.—All Ss received 15 free trials on 
each of Days 12 and 13. During this stage, 
reward was placed in the goal box opposite to 
that which had originally held the reward 
during acquisition and overtraining, so that 
the formerly correct goal box became in- 
correct and vice versa. 


Results 


Acquisition. 


was 


‘The mean proportion 
of correct choices on the last five trials 
was .93. 


An analysis of variance on 
the latter half of acquisition revealed 
no significant differences among the 
groups (F = 1.87). In Group N 
there was no evidence of forgetting 





T MAZE 


TABLE 1 


ToTaL Correct CHOICES 
IN REVERSAI 


Experiment I | Experiment II Experiment III 
Group 


Mean Mean SD Mean SD 
21.88 
18.94 
21.31 
10.75 


20.31 5.69 
16.69, 5.42 
21.38 4.50 
7.06 6.10 


18.15 
9.07 


7.40 
9.13 


over the interval between the second 
and third days of training. 

Overtraining.—Group Fr 

3.3% incorrect responses 
overtraining. The median 
number of correct responses after the 
last error was 54. 

Reversal.—The reversal 
learning for the four groups is shown 
in Fig. 1. The mean number of cor- 
rect choices on the 30 reversal trials is 
shown for the four groups in Table 1. 
The overall F for the four groups was 
37.01, significant at the .001 level. 
Adjacent groups were compared with 
Groups N and Co did not 
differ significantly (¢ How- 
ever, Group Co was superior to Gr up 
Fr at the .01 2.96), and 
Group Fr in turn was superior to 
Group In at the .001 level (¢ = 6.02) 

The number of incorrect 
responses the 
reversal response is shown for the four 


Table 2. All 


made a 
total of 
during 


course ol 


f tests. 


Jf). 


level (f 


mean 


before first correct 


groups in differences 


TABLE 2 


NUMBER OF ERRORS BEFORE First Correct! 
RESPONSE IN REVERSAI 


Experiment I | Experiment I][Experiment III 


Group 


Mean Meat SD Mean 
6.44) 5.75 
9.31) §.93 
5.25) 3.77 
16.19) 8.90 


REVERSAL LEARNING 


were significant at the .01 level except 
that between Groups N and Co, 
which did not approach significance. 
In view of the failure to find an 
overlearning-reversal effect in terms 
of total correct responses in reversal, 
the reversal data were also analyzed 
18 correct 
choices in 20 successive trials, with not 
more than the last 10 
of the 20. This is the same as Pubol’'s 
(1956) except that his 20- 
trial units always involved 2 complete 
days of running, whereas ours could 


in terms of a criterion of 


one error in 


criterion 


begin on any trial. Comparison with 


Pubol’s second experiment is most 
appropriate since it was closest to ours 
in the kind of discrimination involved. 
The numbers of Ss reaching this 
criterion in the four groups were 10, 
7, 10, and 1, respectively. These 
values yield a x? of 13.7, significant 
at the .01 level for 3 df. 

Speeds. 
on the first five trials of reversal are 
shown in Table 3. Stem speeds are 
the starting treadle to the 
treadle just before the choice-point ; 


Speeds in feet per second 


trom 


total speeds are from the starting 
treadle to one of the two treadles just 
For 
speeds there were no significant differ- 


beyond the choice-point. stem 
The overall 
F was 1.29, and the ¢ ratio between the 
two extreme groups (1 and 2) 
1.79. For total speeds, however, the 
overall F was 4.16, significant at the 


ences among the groups. 


was 


rABLE 3 


MEAN SPEEDS IN FEET PER SECOND ON FIrRs1 
FrvE REVERSAL TRIALS 


| 
ent IIjt xperiment III 


ment I | Experi 
} 
Total | Stem 


Total Total 


2.86 


3.23 


stem 


2.70 | 3.87 
3.50 | 4.36 
| 3.03 | 4.45 | 3. 
3.10 | 3.44 
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.01 level. Of the differences in total 
speed among adjacent groups, only 
that between Groups Fr and In 
reached the .05 level of significance 
by ¢ test. 


EXPERIMENT II 


The failure to find an ORE in Exp. | 
brought the purpose of the study into 
question. An attempt was therefore 
made to change the conditions so as 
to replicate the earlier findings of an 
ORE. In order to bring Exp. II 
closer to these studies, the 
number of overlearning trials was 
increased, the intertrial interval was 
made longer, and various other details 
were changed. 


earlier 


Vethod 


Subjects and apparatus.—The Ss were 64 
experimentally naive female albino rats of the 
Sprague-Dawley strain, 74 to 78 
at the beginning of 


days old 
experimental training, 
divided into four groups of 16 as in Exp. I. 
lwo Ss were discarded because of apparatus 
failure and 5 were discarded because on three 
consecutive trials they failed to trip the first 
treadle within the 2-min. time limit or failed 
to trip one of the second treadles within the 
3-min. limit. (On such trials, S was placed 
1 a randomly selected goal box on the first 
occasion and subsequent odd-numbered oc- 
casions, and in the alternative goal box on all 
even-numbered The time score 
was recorded as 180 sec. and the goal box 
into which S was placed was recorded as 
chosen.) These discarded Ss were replaced 
by Ss of the same description. The ap- 
paratus was the same as that used in Exp. | 

Prehandling.—Prehandling was the same 
as in Exp. I, but the feeding schedule was 
changed slightly. The once-daily ration of 
10 gm. was presented 2 hr. after the start of 
prehandling or experimental training, since 
the altered procedure increased the total time 
of experimental training each day from 15-25 
min. to 90-110 min. 

Experimental training.—Except for i 
change in intertrial interval and magnitude of 
reward, all details were the same as in Exp. I. 
In Exp. II, 3-4 min. elapsed between the 
placement of S in its carrying cage following 
a trial and its removal at the beginning of the 
subsequent trial 


occasions. 


In addition, only one 45- 
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mg. pellet was present in the proper goal 
box on each trial. These changes applied to 
acquisition, overtraining, and reversal. 

Acquisition.—F ifteen free trials were given 
on each of 2 consecutive days. Groups Fr, 
Co, and In all received acquisition on Days 1 
and 2 of experimental training. Group N 
was prehandled (in the same manner as 
before) on Days 1-9 and then given acquisi- 
tion on Days 10-11. 

Overtraining.—Group Fr was given 20 free 
trials on each of Days 3-6 and 8-9 and 15 
free trials on each of Days 10-11, for a total 
of 150 free overtraining trials. Group Co 
was given 20 trials, all forced to the correct 
side, on each of Days 3-6 and 8-9 and 15 
trials, also forced correct, on Days 10-11, for 
a total of 150 forced correct trials. Group In 
was given 40 trials on each of Days 3-6 and 
8-9 and 30 trials on Days 10-11. Half of 
these 300 trials were forced correct and half 
forced incorrect. 

On Days 12 and 13 all Ss were 
given reversal training as in Exp. | 


Reversal. 


Results 
Acquisition. 

of correct 

trials of 


The mean proportion 
the last five 
acquisition was .93. An 
analysis of variance of the last half 


choices on 


of acquisition revealed no significant 
difference among the groups (/ < 1). 


To check comparability with one 
previous study using spatial discrimi- 
nation, it was determined for each S 
on what trial, if at all, it reached the 
criterion defined in Exp. |. For the 
four groups combined, 35 of the 64 Ss 
reached this criterion, the median for 
completion of the criterion being 
lrial 29. This means that by Pubols’ 
definition the median S received one 
trial of overtraining during ac- 
quisition. 
Overtraining.—Group Fr 
a total of 2.7% 


during 


made a 
incorrect choices 
The median 
number of correct responses after the 
last error was 100. 

The of reversal 
learning for the four groups is shown 
in Fig. 1, and 


overtraining. 


Reversal. course 


the mean number of 
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choices for all 30 
in Table 1. The pattern of 
group differences is clearly similar to 
that found in Exp. I. 


correct is 


trials 
shown 


Analysis of 
variance of the four group means 
yielded an F of 22.77, significant at the 
001 level. By ¢ test, Group In was 
found to be lower than each of the 
others at the .001 level, while Group 
Fr was lower than Group Co at the 
.O5 level. 

significant. 


Other differences were not 


he mean number of errors before 
lable 2. 
Group N does not differ significantly 
from Groups Fr and Co, but all other 
differences are significant at at 
the .05 level. 


the first reversal is shown in 


least 


data were also ana- 
lyzed according to the same criterion 
as in Exp. I. The numbers of Ss 
reaching criterion in the four groups 
were 8, 7, 14, and 1, 


The reversal 


respectively, 


Reversal learning curves in the three experiments 


yielding a x? of 21.3, significant at the 
.001 level for 3 df. 

Speeds.—Speeds on the first five 
reversal trials are shown in Table 3 
The overall difference among the 
groups is significant at the .01 level 
both for stem speeds (F = 7.27) and 
for total speeds (F = 5.95). In both 
cases the two higher groups differ 
significantly from the two lower, but 
Group Fr does not differ significantly 
from €o or Group N from In. 


Ii] 


Since the pattern of results in Exp. 
|! did not differ appreciably from that 
in Exp. I, a further attempt was made 
to find an ORE. Since Ss in Exp. 
| and II were younger than those in 
some of the previous studies that 
found an ORE, older Ss were used in 
Exp. III. Only Groups N and Fr 
were included in this experiment. 


EXPERIMENT 
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Method 


Subjects and apparatus.—The Ss were 27 
experimentally naive female albino rats of the 
Sprague-Dawley strain, 120-121 days of age 
at the beginning of experimental training. 
Three Ss were discarded 
consecutive trials they failed to trip the 
treadles within the specified time limits, 
which were the same as in Exp. Il. These 
discarded Ss were replaced by other Ss of the 
same description. 

Procedure and design.—The Ss were ran- 
domly assigned to one of two groups which 
were treated exactly like Groups N and Fr 
of Exp. Il. There were 13 Ss in Group N and 
14 Ss in Group Fr. 


because on three 


Results 


Acquisition.—Mean_ proportion of 
correct choices on the last five trials 
of acquisition was .94 for Group N 
and .84 for Group Fr. A ¢ test on the 
number of the 
latter half of acquisition revealed that 
Group N was significantly superior 
at the .02 level (¢ = 2.63). 


correct responses in 


No reason 
The 


median S in the two groups combined 


for this difference is apparent. 


just reached criterion, with no addi- 
tional trials. 

Overtraining.—Group Fr had a 
mean of 2.0% incorrect choices during 
overtraining. ‘The median number of 
correct responses after the last error 
was 102. 

reversal 
The total 
responses in re- 
Table 1. The 


superiority of Group N is significant 
In view 


Reversal.—The course of 
learning is shown in Fig. 1. 
numbers of correct 


versal are given in 


at the .01 level (¢ = 2.89). 
of the superiority of Group N in 
acquisition, this might be attributed 
to a chance superiority in learning 
ability of Ss in the group. However, 
the within-groups correlation between 
number of correct choices in the latter 
half of acquisition and number correct 
during reversal is negative and non- 
significant (r =—.22), which makes 
such an interpretation implausible. 


SPEAR, AND K. N. 


CLAYTON 


Mean errors before the first reversal 
are shown in Table 2. The difference 
between the groups is significant at 
the .01 level. The reversal criterion 
previously described was reached by 6 
Ss in Group N and 3 in Group Fr, 
vielding a nonsignificant x? of .91. 

Speeds.—Stem speeds and _ total 
speeds for the two groups are shown in 
Table 3. As in both previous experi- 
ments, Group Fr was faster than 
Group N on both measures. The 
differences were not significant, how- 
ever (t’s = .85 and 1.20, respectively). 


DISCUSSION 


Reversal learning was found to be 
temporarily retarded by free-trial over- 
training, greatly retarded by forced-trial 
overtraining when half the trials were 
to each side, and unaffected by forced- 
trial overtraining when all the trials 
were to the correct side. These effects 
were primarily accounted for by the 
duration of perseveration on the formerly 
correct side in the different groups at the 
beginning of reversal, as may be seen by 
comparing Tables 2 and 3. 

The most noteworthy aspect of these 
findings is the failure to confirm the 
finding of other investigators that over- 
learning facilitates reversal. We seem 
to have ruled out the possibility that 
the discrepancy between our results and 
those of previous Es is due to age of Ss, 
number of overlearning trials, or inter- 
trial interval. Four differences remain 
between our procedure and earlier ones. 

1. We 


acquisition 


used a constant number of 


trials before overtraining 
instead of carrying all Ss to a criterion. 
Because of the variability of the Ss, a 
training procedure which gives a con- 
stant number of trials may fail to insure 
that all Ss will master the original task 
or may permit a few to receive a degree 
of overtraining. Either of these condi- 
tions might facilitate reversal perform 
ance of Group 1 and reduce the chance 
of demonstrating an ORE. For this 
reason comparisons were made among 
three subgroups of Groups N and Fr: 
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(a) those that failed to reach the acquisi- 
tion criterion, (b) those that had 5 or 
fewer acquisition trials after reaching 
criterion, and (c) those that had 6 to 10 
(the maximum possible) acquisition trials 
beyond criterion. In all three subgroups 
in all three experiments, Group N made 
more correct reversal responses than 
Group Fr. (In view of the small N in 
these subgroups, no statistical tests were 
made.) This suggests that the failure 
to use an acquisition criterion was not 
critical. There might be an interaction, 
however, between this consideration and 
the next one. 

2. We gave a number of 
reversal trials rather than carrying each 
S to a reversal criterion. 
Group Fr, once they overcame their 
initial disadvantage, would have sur- 
passed Group N in reaching criterion. 
Neither the reversal learning curves nor 
the number of Ss reaching criterion in the 
two groups lend any support to this 
view. As for the possible interaction of 
Differences 1 and 2, of the Ss in all three 
experiments that reached the acquisition 


constant 


Perhaps Ss in 


criterion with five or fewer trials to spare, 
5 in Group N and 4 in Group Fr reached 
the reversal criterion. 

3. We tried to control differences in 
amount of handling between overtrained 


and nonovertrained groups by giving 
Group N extra handling when they were 
not being run. This may have con- 
trolled away the whole effect. If this 
should be the explanation, it would make 
the overlearning-reversal effect a more 
trivial phenomenon 
suspected. 

4. Our task simple 
discrimination with no irrelevant visual 
cues Pubols had) and no 
separation of place and response cues (as 
with Brookshire et al.). If observing 
responses are the crucial factor in the 
overiearning-reversal effect, such a simple 
discrimination may be too easy for such 
responses to be important. 


than has been 


was a spatial 


(such as 


This ex- 
planation implies that our task is easier 
than those in which the effect has been 
found. In terms of acquisition rate, this 
appears to be true of most of the other 
studies, though not of Pubols’ Exp. 2 
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If this exception can be explained by the 
extensive pretraining in Pubols’ experi- 
ment leading to faster learning, then the 
simplicity of our discrimination might 
well be the crucial factor in our failure 
to find an overlearning-reversal effect. 
This would be consistent with D'Amato 
and Jagoda’s (1962) failure to find an 
overlearning-reversal effect in a spatial 
discrimination. Their study, of which 
we were unaware when conducting ours, 
is to our knowledge the first published 
failure to find an overlearning-reversal 
effect with high overlearning. The 
situation is complicated, however, by 
an unpublished study of Erlebacher 
(1961). He failed to find the over- 
learning-reversal effect, even though he 
used a brightness discrimination. The 
necessary conditions for obtaining an 
overlearning-reversal effect thus remain 
in doubt. 

Our results and the strikingly similar 
findings of D’Amato and Jagoda seem 
to be clearly incompatible with the first 
three interpretations of discrimination 
reversal listed in the introduction. These 
findings can, however, be _ reconciled 
with Interpretations 4 and 5, both of 
which are concerned with avoidance of 
the formerly incorrect side. Either of 
these interpretations is consistent with 
the rank order of our three overtraining 
groups. The unexpected superiority of 
Group N in reversal could then be ex- 
plained by their weaker tendency to 
approach the formerly correct side, a 
tendency supported by the 
slower speeds of Group N. 


generally 
The similar 
patterns of stem speed and total speed 
suggest that time spent in the stem 
rather than time spent in the choice area 
or arms was the major source of variance 
in both speed measures. 


SUMMARY 


Experiment I attempted to find the cause 
of the overlearning-reversal effect by 
paring T maze reversal learning by four 
groups of rats that received different patterns 
of overtraining in acquisition. Reversal was 
fastest for the group receiving no overtraining 
and the group receiving all its overtraining 
trials forced to the correct side. 


com- 


Free-choice 
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overtraining gave somewhat slower reversal, 
and overtraining with an equal number of 
forced trials to the two sides gave much slower 
reversal. 


In view of the failure to replicate previous 


findings of faster reversal after overtraining, 
two further experiments 
attempt to replicate these earlier findings. 
Both experiments gave the same pattern of 
results as Exp. I; 
effect was found. These results appear to be 
consistent with interpretations in terms of 
stimulus satiation or of avoidance of 
rewarded cues, but not with interpretations 
in terms of observing 
nability, or frustration. 


were run in an 


no overlearning-reversal 


non 


responses, discrimi 
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MONETARY 


INCENTIVE AND RANGE OF PAYOFFS 


AS DETERMINERS OF RISK TAKING! 


LEONARD KATZ 


l'niversity of Massachusett 


Myers and associates 
Kort, 1961; Myers & 
Myers & Sadler, 1960) been 
concerned with the effects of pa- 
rameters of the payoff distribution 
upon 


& 
1962: 


(Myers 
Katz, 
have 


the choice between gambling 
and not gambling. Myers and Sadler 
(1960) varied the number of chips 
which might be won or lost on each 
gamble (range), the average payoff 
being zero. When the alternative to 
gambling was the sure gain of one 
chip, gambling increased with in- 
creases in range; when the alternative 
to gambling was the sure loss of one 
chip, gambling with 
increase in range. 

In the present experiment chips 
worth no money and chips worth 5¢ 
were used to provide data on effects 
of incentive value. Monetary incen- 
tive and range of payoffs are similar 
to each other in that an increase in 
either the risk associated 
with each gamble. The implied hy- 
pothesis is that increased incentive, 
like increased range, may result in 
more gambling, vhere the alternative 
to gambling is tne sure gain of one 
chip, and in less gambling when the 
alternative to gambling is the sure 
loss of one chip. The objective of the 
present study was, therefore, to 
obtain data to test this hypothesis. 
The S’s 


decreased an 


increases 


decision to gamble under 

1 This research was supported by National 
Science Foundation Grant G-11380 and 
National Institute of Mental Health Grant 
M-3803, and was part of a study submitted 


in partial fulfillment of the Master of Science 


degree at the University of Massachusetts 
Che author is indebted to Jerome L. Myers, 
who served as thesis adviser 
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different payoff ranges was followed 
by the loss or gain of chips which had 
no monetary value or was followed by 
the loss or gain of chips worth 5¢. 

In addition to providing data on the 
effects of monetary incentive, the 
present study investigated the effects 
of a payoff range greater than that 
used in the Myers and Sadler (1960) 
study. To control for any range 
effects due to differences among 
ranges in sequences of payoffs, one 
sequence was randomly generated and 
the other two were derived from it in a 
manner described in the procedure 
section. 

METHOD 


Materials.—Four decks of 100 3 K 4 
white cards were prepared. The known 
payoff deck contained 50 cards with +1 
written on them, alternated randomly with 
50 cards with —1. The other decks, of 100 
cards each, provided for three different ranges 
of unknown payoffs. The narrow-range deck 
, 


In 


N), had integers randomly chosen from + 
to +6 and from —2 to —6. The medium 
range deck (M), was constructed by adding 
10 to every positive number of (N) and sub- 
tracting 10 from every negative number, 
giving a range of +12 to +16 and —12 to 

16, retaining the same ordinal positioning of 
cards as found in Deck N. The wide range 
deck (W), was constructed by adding and 
subtracting 20 to the integers of Deck N. 

Procedure.—On each of 3 successive days S 
was presented with the known payoff deck, 
and a different one of the three unknown 
payoff decks. Order of use of Decks N, M, 
and W was counterbalanced in a 3 X 3 Latin 
square with 6 Ss given each order. Eighteen 

Ss gambled only for poker chips (0¢) and 18 
others gambled for chips worth 5¢ apiece 
The main features of the experimental design 
are shown on the left-hand side of Table 1 

The Ss were tested individually, 
given full instructions at 

Session 1. 


being 
the beginning of 
For Sessions 2 and 3 Ss were told 
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TABLE 1 
MEANS OF PROPORTIONS OF GAMBLING RESPONSES ON +1 AND —1 TRIALS 
FOR SUCCESSIVE 25-TRIAL BLOCKS 


Alternative 


Incentive | to Gambling 


Range 


| 


only that the unknown payoff deck of that 
day was a Two hundred poker 
chips were stacked in front of S, and Ss with 
a monetary incentive (5¢) were told that 
each chip was worth 5¢; i.e., their initial 
stake was worth $10.00. 

Briefly, Ss were instructed to turn over 
the top card in the deck of known payoffs 
at the beginning of each trial. They then 
chose between standing pat by accepting the 
gain or loss of one chip represented by the 
card and gambling by drawing the top card 
from the deck of unknown payoffs. If S de- 
cided not to gamble, the top card in the un- 
known payoff deck was turned anyway, show- 
ing what he would have won or lost had hegam- 
bled. 

Ratings.—At the end of each session, Ss 
were given an 11-point scale from —S5 to 
+-5, along which they rated the means of the 
known payoff deck and of the unknown 
payoff deck used that session. They were also 
asked to describe their gambling strategies 
and changes in strategy. After Session 3, 
Ss were asked what they would have done 
differently in Sessions 1 and 2. 

Subjects —The Ss were 36 male under- 
graduates enrolled in the university summer 
session, who were divided randomly into two 
groups of 18 Sseach. Each S was paid $3.00. 


new one. 


RESULTS 


Choices. 


The scores were propor- 
tions of choices to gamble on both +1 


Trials 
Overall 


and —1 trials during each of four 25- 
trial blocks. For example, if in a 
particular block S gambled on 3 of the 
12 (or 13) trials in which the alterna- 
tive to gambling was +1, his +1 
proportion for that block was .250 (or 
.231). Table 1 presents means of 
these proportions for each combina- 
tion and range. An analysis of 
variance was performed on arc-sine 
transforms of the proportions. The 
results of this analysis (shown in 
Table 2), in conjunction with the 
relationships shown in Table 1, sug- 
gest the following conclusions: (a) 
under all combinations of range and 
incentive, more risk-taking occurs 
on —1 trials than on +1 trials 
(P < .001); (6) the difference in in- 
centives had little effect on total 
number of risks taken; (c) more 
gambling occurred with Deck M than 
with Deck N on +1 trials and the 
reverse on —1 trials (P < .01); and 
(d) value, incentive, and range have a 
joint effect on gambling (P < .001). 
A further breakdown suggests that 
this effect is due largely to differences 
in the quadratic curvature of the V-| 
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curves over the three ranges. For 
+1 trials, the curves for incentive are 
essentially mirror each 
other; for —1 trials, the curves for 
incentive are essentially parallel to 
each other. 


images of 


In addition, there were a number 
of significant interactions involving 
blocks of trials. These effects are 
probably due to the interaction of 
range, incentive, and value with the 
effects of both the temporal sequence 
of blocks within a single session and 
the different mean payoffs of blocks. 

Ratings.—Table 3 shows the means 
of Ss’ ratings of the means of the 
known and unknown payoff decks for 
each combination and incentive. The 
means for the known payoff deck were 
closer to the true mean of zero and 


TABLE 2 


\NALYSIS OF VARIANCE OF ARC-SINI 
[TRANSFORMS OF THE PROPORTION 
OF GAMBLING RESPONSES 
IN EACH TRIAI 


Source r ws 
Incentive (I 
Ss/I 
Value (V) (known 


710.00 
2,047.71 


a 
— 


825.00, 33. 
,361.40 
454.93 
987.23 
225.60 
1§7.01 
367.75 
2,910.50, 
56.80 
292.70 

,606.10) 13.66" 
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RISK TAKING 
TABLE 3 


MEANS OF Ss’ RATINGS OF 


Deck MEANS 


Payoff 


Range Known Unknown 


O¢ O¢ 


N 33 ° 44 
M 33 .00 
W 44 22 .88 


were less variable than the means for 
the unknown payoff decks. Neither 
set of means varied systematically 
with incentive or range of unknown 
payoff. Nor were there any 
tematic relationships between these 
deck means and either overall propor- 
tions of risks or proportions of risks 
for the last 25 Most Ss re- 
ported “‘gambler’s fal- 
lacy’? strategy and none reported an 
awareness of gambling differentially 
with different decks of unknown 
payoffs. Incentive had no differential 
effect. 


sys- 


trials. 
following a 


| disc USSION 


The decision to gamble or not to 
gamble following known outcomes of 
loss or gain of a poker chip was in- 
vestigated as a function of three ranges 
of unknown payoff involving the loss or 
gain of chips worth nothing or worth 5¢ 
Previous findings (Myers & Katz, 1962 
Myers & Sadler, 1960) of more gambling 
when the alternative to gambling was the 
loss of a chip than when the alternative 
was the gain of a chip were confirmed. 
This difference was reduced by monetary 
incentive, although the interaction was 
not statistically significant: trials when 
the alternative was —1 


were fc le »wed 


by fewer choices to gamble for chips 
worth 5¢ than for those worth nothing, 


while trials where 


the alternative was 
+1 were followed by more choices to 


gamble for chips worth 5¢ than for thos« 
* Due to the procedure for constructing the 


payoff sequences, prediction of the alternative 
event following a run is not really fallacious, 
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worth nothing. Thus, there is some 
suggestion that monetary payoff and 
range are functionally equivalent; for all 
three ranges monetary incentive yielded 
an increase in gambling on +1 trials, a 
decrease on —1 trials. 

Other experiments have shown the 
form of value and 
obtained for the 5¢ 


range interaction 
incentive group 


Myers and Sadler, using three ranges up 


to and including the range of Deck M of 
the present study, found that gambling 
consistently increased as range increased 
when the alternative was a gain of one 
chip, but gambling decreased as range 
increased when the alternative was a one- 
chip Myers and Katz (1962) 
obtained similar results through a range 
of Deck M. Suydam and Myers (1962), 
using a very different procedure, found 
this convergence of positive and negative 
value curves over range, for several 
values. It may be assumed that the 
gamble represents an approach-avoid- 
ance conflict, both tendencies increasing 
is range Within this frame of 
reference, the data suggest that, as range 
increases, the avoidance gradient rises 
more swiftly against negative alter- 
natives; the approach gradient rises more 
swiftly against positive alternatives. 


loss. 


does. 


SUMMARY 


The effects on 
monetary incentive, 


behavior of 
payoffs for 


gambling 
range of 


LEONARD KATZ 


gambling, and value of a payoff which could 
be taken in lieu of gambling were determined. 
One group of 18 Ss gambled for chips only 
and another group of 18 Ss gambled for chips 
worth 5¢ each. Three ranges of unknown 
payofis were used, one at each of three ses- 
sions. The known payoff, the acceptance of 
which was the alternative to gambling, 
remained constant 

Neither incentive nor range had a signifi- 
cant effect upon the total number of risks 
taken. The Ss gambled significantly more 
when the alternative to gambling was a loss 
(—1) than when it was a gain (+1). Several 
interactions were significant, including Range 
< Value and Range X Value X Incentive. 
These led to the conclusion that gambling is 
affected differentially on +1 and —1 trials 
by the range of chips to be gained or lost, and 
by the interaction of range and monetary 
incentive. 
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SPATIAL S-R CONTIGUITY 
DISCRIMINATION 


D. STANDISH 


University 


hough differing in purpose, both 
of the experiments to be reported are 
basically similar in design to an 
earlier study (Champion & Standish, 
1960) in which preliminary training 
was given with two pairs of stimuli in 
fixed spatial relations (spatial dis 
crimination learning) followed by a 
test stage involving the two pairs and 
their transposes (nonspatial learning). 
Under these conditions negative trans- 
fer from training to test occurred when 
between-pair differences among the 


stimuli were greater than within-pair 


differences, but positive transfer oc- 
curred when within-pair differences 
dominated. These findings were in- 
terpreted in terms of the type of S-R 
theory proposed by Spence (1960) ; 
in particular, the differential transfer 
was explained through the presence o1 
absence of within-pair discriminations 
on the part of S in the training period, 
it being argued that the occurrence of 
these discriminations promoted posi 
tive transfer while their 
caused interference. The basis of the 
present studies rested in the further 
assumption that spatial S-R 
tiguity would also promote within-pair 
discriminations, and it was predicted 
that groups trained and tested under 
such conditions would show positive 
transfer, even though between-pair 
differences were dominant. 


absen e 


con- 


The first experiment consisted of a 
repetition of the 1960 study with the 
that S 
respond by pressing a button adjacent 


exception was required to 
to the stimulus rather than one some 
distance from it. In order to make a 
severe test of the effect of this form 


AND 


IN HUMAN 
LEARNING 


R. A. CHAMPION 


of Sydney 


of S-R contiguity, the stimuli in each 
pair were presented close together in 
space (S-S proximity), but this al 
lowed the possibility that the result 
of the experiment was due to prox 
imity rather than contiguity, the 
factors being confounded. The second 
experiment was therefore conducted 
in an attempt to separate these two 
factors and attention 


two 


was concen 


trated on the contiguity variabie 


with degree of proximity held const ant 


EXPERIMENT | 
Method 


Subject The 44 undergraduate 
from courses in psychology at the University 


of Sydney, there being 13 Ss in Groups 1 and 
)? 


SS were 


and 9 in Groups 3 and 4. Color-blind 


students were excluded 

Apparatus and stimuli rhe 
was identical with that used 
Champion & Standish, 1960), allowing 
be presented with colored circles of light 1 in. 
in diameter, except that the lights were only 

in. apart in the horizontal midline of the 

X 2 in 
buttons were mounted as close to the stimuli 


apparatus 
previously 
S to 


milk-glass screen, and the response 


located im- 
each light. In an attempt 
to equate within-pair differences the stimulus 
settings were 
of red (R 

system, with brightne 


as possible, one button being 


mediately above 
idjusted to the following values 
Munsell 
ss and saturation held 
approximately constant: Stimulus R; = 5R, 
R, = 5YR, G; = 7.5GY, Ge = 2.5GY. The 
results of the 1960 study had 
confirmed with 
1960) 


and green (G) on the 


ilready been 
these setting: Standish, 

Procedure The instructions to OS wet 
slightly modified to allow for the new location 
of the buttons, between re- 
ponses .§’s rested on the 


response and 


hand 


bel yw 


table im 
the vertical miidline of the 
otherwise the 


iediately 
screen, but proc edure was 
identical with that reported earlier, the con- 


trol groups being denied preliminary training 
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TABLE 1 


StimuLus Parrs FoR Ss IN Exp. | 


| Experimental Groups Control Groups 





Stage} 
| l 3 2 4 
b w W b b> w Ww b 

|} Rit: Re— | Git: Re 
| Ge—:Git | G2:—:Rit 
| snautensl sini ssenieiein 
Ri+:Re— | Git: Re— | Rit: Re—-|Git+:R 

. G2—:Git | Ga—: Rit | Ge—:Git | Ge—:Rit 
R:—:Rit+ 


:Rit+|/Re—:Git 
Ri+:G:- 


Note.—The spatial relations of the symbols in each 
pair correspond to the actual left-right arrangements 
of the stimuli. The designations b > w etc. refer to 
relative magnitudes of stimulus differences between and 
within pairs 


so as to allow an assessment of the nature and 
amount of transfer. A summary of the 
procedure and stimulus pairs is given in 


Table 1. 


Results 


The results, in terms of mean trials 
to the criterion of eight successive 
correct trials, are summarized in 
Table 2. Because of the presence of 
some extreme scores, nonparametric 
were used. ‘The 
application of a U test to the data of 
the first stage for Groups 1 and 3 (two 
stimulus pairs in fixed spatial rela- 
tions) confirmed the earlier finding of 
superior performance with between- 


statistical tests 


pair differences dominant (U = 6 
with m, = 13 and nm. = 9, P 001). 
TABLE 2 
TRIALS TO CRITERION IN Exp. | 

Between-Pair Within-Pair 
Differences Differences 
Dominant Dominant 
tage 
Group 1 | Group 2 | Group 3 | Group 4 
(Exp.) Control) Exp.) Control) 
Training 
Mean 14.4 25.9 
SD 3.6 9.4 
lest 
Mean 19.0 32.8 16.3 33.6 
SD 22.8 9.4 4.8 19.8 
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The use of a factorial median test 
(Sutcliffe, 1957) with the data of the 
test stage (two stimulus pairs and 
their transposes) failed to reveal any 
differential transfer, there being no 
significant interaction between the 
experimental-control variable and 
type of stimulus difference (x? = 3.38 
for 1 df). Subsequent L’ tests showed 
that significant positive transfer oc- 
curred with both between-pair and 
within-pair differences dominant 
(U = 38.5 with m, and a, = 13, 
P < .02, and U = 11.5 with n,; and 
no = 9, P < .02, respectively). 


EXPERIMENT I] 
\lethod 


Subjects.— The Ss were 60 undergraduates 
from courses in psychology, divided into four 
groups of 15. Color-blind students were not 
excluded. 

A pparatus.— The apparatus consisted of a 
sheet of building board 72 in. wide and 36 in 
high, mounted vertically on a table in front 
of S and containing two 2} X 2} in. pearl- 
perspex squares 4 in. apart in the horizontal 
midline. An attempt was made to locate S 
equidistant from the two squares so that the 
centers subtended the same visual angle as 
did the centers of the circles in the 1960 study. 
Housed behind the perspex squares were 15-w. 
lamps whose intensity was controlled by two 
variacs. The duration of illumination of the 
stimuli on any trial was 200 msec. In order 
to achieve spatial contiguity of stimulus and 
response S was instructed to use response 
mounted on each perspex square, 
one in each top corner nearer to the vertical 
midline of the building board. To obtain 
noncontiguity conditions, S was told to use 
two buttons mounted side by side in a small 
metal box located 3 in. in front of the board, 
the box being centered 16 in. to the right of 
the midline of the board. The response 
buttons activated indicator lamps as a signal 
to E. 

Stimuli Che chief basis of discrimination 
in Exp. II was intensity of illumination, and 
the four values of the stimuli in foot-Lambert 
units were as follows: B,; = 125, Bs = 100, 
D, = 2.5, D2; = 3.2. These illumination in- 
tensities were measured with a photometer at 
the usual location of S’s head. Lights B; and 
B. were similar bright stimuli, whereas D, 


buttons 








HUMAN 


and D, were similar dull stimuli. The change 
from bright to dull illumination of the filament 
lamps was accompanied by the usual change 
in hue. 

Procedure lhe were 
again modified to the extent demanded by the 
changes in the apparatus and the conditions 
of training. The independent variable was 
degree of spatial S-R contiguity (buttons 
near to or far from lights) and the stimulus 
conditions limited to 
between-pair differences, the required dis- 
criminations being between B,; and Bs, and 
between D, and De. Otherwise the procedure 
was as before, with training and test stages 
for the two experimental groups (Group 1, 
and Group 3, 


instructions to S 


were dominance of 


contiguity, noncontiguity), 
and test stages alone for the corresponding 
control groups (Groups 2 and 4, respectively). 
The criterion of learning was increased to 12 
successive correct trials 


Results 


The trials-to-criterion scores for the 
training and test stages are presented 
in Table 3. For 


these data it was 


possible to use a parametric statistic, 
and the application of a ¢ test to the 


and 3 
revealed no significant difference. An 


training scores of Groups 1 


analysis of the variance for the test 
data of the second stage showed that 
while the pretraining had no con- 
sistent effect (experimental vs. control 
groups), the influence of spatial S-R 
contiguity and degree of differential 
transfer (interaction) were both sta- 
tistically Follow-up ¢ 
showed differences 


significant. 


tests that the 


rABLE 3 
TRIALS 10 CRITERION IN Exp. II 
Noncontiguity 


Contiguity 


Group 1 
Exp.) 


Group 2 | Group 3 
(Control)} (Exp.) 


Group 4 


lraining 
Mean 25.6 
SD 20.6 
Test 
Mean 18.9 
SD 8.9 
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between Groups 1 and 2, and Groups 
1 and 3 were significant (¢ = 2.56, 
P < .02, and ¢t = 4.74, P < .01, re- 
spectively, for 28 df) but that the 
differences between Groups 2 and 4, 
and Groups 3 and 4 were not signifi- 
cant. It thus appears that positive 
transfer occurred only under 
tiguity conditions, but that there was 
no negative transfer with nonconti- 
guity of S and R. 


con- 


DISCUSSION 


The results of the two experiments 
lead to the conclusion that spatial S-R 
contiguity, like between-pair similarity, 
leads to positive transfer from the spatial 
to the nonspatial learning situation 
(training to test), and if the theoretical 
interpretation given in terms of S-R 
theory is correct, then this is because 
the contiguity promotes more effective 
within-pair discrimination in the first 
The chief finding of Exp. I was 
that positive transfer from training to 
test may be obtained even when between- 
pair differences are dominant, provided 
that S be required literally to approach 
the positive stimulus cue in the course 
of the response. This result is to be 
contrasted with that of the 1960 study, 
where the dominance of 
differences produced negative transfer. 
That the present 
attributed to the greater proximity of the 
stimulus cues in Exp. I is demonstrated 
by the similar result in Exp. II with a 
return to the same degree of 
proximity as obtained in the 1960 study. 

The data of Exp. II suggest that the 
direct effects of contiguity may not be 
as powerful with adult humans as with 
children and lower animals, by com 
parison with the marked effects obtained 
by Murphy and Miller (1958, 1959). 
Learning was more efficient under con- 
tiguity conditions both in the training of 
Groups 1 and 3 and in the test periods of 
Groups 2 and 4 (Table 3), but in neither 
case was the difference 
significant. However, 
scores for each 


stage. 


between pair 


outcome cannot be 


non 


statistically 
when the two 


S in the experimental 
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groups of Exp. II (Groups 1 and 3) on 
the training and test stages were com- 
bined, so as to make the two tasks one, 
a significant difference emerged (¢ = 4.32 
for 28 df, P < .01), possibly due to the 
greater stability of the scores. It will 
also be noted that the significant nega- 
tive transfer obtained in the 1960 study 
with noncontiguity and between-pair 
differences dominant was not duplicated 
in Groups 3 and 4 of Exp. II, although 
the result was in the same direction. 

’s recent elaboration of an S-R 
theory of 


Spence 
selective learning (Spence, 
1960) points to some of the complexities 
to be coped with in any detailed formula- 
tion, but brief consideration may profit- 
ably be given to the broader theoretical 
significance of spatial S-R contiguity. 
In the present context this variable has 
been assumed to act through within-pair 
S-R terms the 
this latter discrimination 
turn upon the relative 
strengths of the orienting and approach 
tendencies to the positive and negative 
discriminanda. 


discrimination, and in 
efficiency of 


depends in 


The necessary link be- 
tween spatial contiguity and the relative 
strengths of the correct and incorrect 
responses may now be provided if it be 
allowed that spatial contiguity amounts 
to temporal contiguity of stimulus and 
response, for the importance of the latter 
variable in simple learning seems un- 
doubted (e.g., Champion, 1962). The 
equation of spatial and temporal con 
tiguity in the discrimination-learning 
situation follows from the fact that when 
S is required to respond by pressing a 
button in or near the positive stimulus 
(contiguity) then there is more likely to 
be a short time interval between exposure 
to the stimulus and the occurrence of the 
response than if S has to look away from 


the stimulus to locate the appropriate 
This 


effect might show up more clearly if the 


response button (noncontiguity). 


stimuli were presented for a long time 
interval, but the argument applies even 


with intervals as short as 100 msec., for 


AND R. A. 


CHAMPION 


there is no requirement that stimulus and 
overlap in the 
“contiguity” and ‘‘noncontiguity”’ being 


response time, terms 


relative rather than absolute. 


SUMMARY 


Iwo experiments on discrimination learn 
ing were conducted under conditions in which 
with 
fixed spatial relations, 


preliminary training was two 
pairs ol 


followed by 


given 
stimuli is 
test learning involving the two 
pairs and their transposes. The aim was to 
test the effects of spatial S-R contiguity, for it 
was predicted that contiguity would cause 
positive transfer from training to test. The 
prediction was confirmed, and the result was 
interpreted in S-R terms with the hypothesis 
that contiguity, like between-pair similarity, 
promotes within-pair discriminations on the 
part of the learner, it being supposed that 
spatial contiguity has this effect through the 
nore basic variable of temporal contiguity. 
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LZ 


When Ss are familiarized with the stimulus 
units but not the response units of a list prior 
PA) learning, it has 
found that 


has a slight 


to paired-associate 
beer stimulus familiar 


ization (SI 


generally 
elfiect or 
I ndet 


Gannon a 


inhibitors 
no effect at all on PA performance 
wood & Schulz, 1960 
Noble 1961 have 


facilitatic ol periormance o i 


Recently 


reportes Ss] 


dissyllables following 20 trials 


the latter result is the one to be 


lrequency ol prior experience is 


through which stimulus mea 


its eflect on PA periormance, it 1s 
with considerable theoretical significa 


Cieutat, Stor kwell, & Noble 1958: 
vood & Schulz, 1960 However, the 


etiect un SF is also co spi 


Ol a positive 


ously inconsistent with results of previo 
* 


tudies Therefore seemed especially 


portant to further assess the reliability of th 


result and consider potential iter t 
explanatio lor it 
One such alternat 
Noble’s procedure of havi 
h 


stimulus unit during t 


when combined 


amount of SF, may have u 


nterval, 


duced simultaneous variatio 
le eth the anticipatior 

practice in articulation of stimulus 
amount of SF, f | 


directly related to imiliarizec 


Ss might ) 


spend relatively less of the 2-se« 
inticipation interval for stimulus articulatio 
than nonfamiliarized Ss. On the the 
betwee PA 
ind length of the anticipatio 
interval Gannon and Noble's results 
then be expected The 
tested this hypothesis by 


basis ot 


presumed direct relationship 
performance 
would 
present experiment 
comparing the 
performance of Ss instructed to pronounce the 
stimulus units during PA anticipation with 
performance of Ss instructed not to pronounce 
\ significant interaction of PA instruction and 
amount of SF will be required to pport the 
present contentior 
Method \ 2 X 3 factorial desig: 

levels of PA instruction 
nonarticulation (NA) of stimulus units—a 

3 amounts of SF (0, 20, and 60 trials) was 
The six respective conditions will be 


of the 


articulation 


used 


referred in terms of the values 


REPORT: STIMULUS FAMILIARIZATION 
IN PAIRED-ASSOCIATE LI 


ARNING 


IRVING I 


independent variable associated with them 
e.g., Cond. A» articulation instructions and 
O familiarization, Cond. N A, 


instructions and 60 trials of 


nonarticulation 
familiarizatio 
etc.) 

The materials 
tical to 


ind procedures were ide 
Nobk 
a) A 
rate of presentation was used during 
PA pertorm ince consisted 
of 17 anti ipation trials 

A total of 144 Ss, 24 per condition, taki 
introductory psychology at the University of 
lowa were randomly 
they appeared at the laboratory Th 


those used by Gannon and 
1961) with the following exceptions 
85cm 


familiarization; (/ 


issigned to condit ons as 
Ss had 
not served in prior erbal learning ex 
periments 
Results an n.—Performane 


the PA list under 


mean 


x Conditions, in ter 


total nut 7 correct respons 


17 anticipation trials, is shown 


: t 
Che predicted interaction betwee 
instructions 

iined and 
iriance to be 
F 3.78, df : 
vent with the re 
1961 


creasing function ot number ot ta niliarizat oO 


pertormance was i monotonK " 


trials when 5S were required to pronour 
stimulus terms of 
ion of the 


1 PA list prior to ar 


PRONOUNCE = ©——@ 
je” OO NOT PRONOUNCE = 0---0 








MEAN NUMBER OF CORRECT RESPONSES 





20 40 


AMOUNT OF FAMILIARIZATION 


1 Mean total number of correct respx 
ng 17 anticipation trials as a function of PA i: 
ictions and number of stimulus familiarization trial 
Ihe standard error of the eans in Fig. 1 
rom the within-groups MS of the overall ar 


uriance, was 2.71 
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nonarticulation PA instructions, performance 
was inversely related to amount of familiariza- 
tion. It can also be seen from Fig. 1 that, 
even though the facilitating effects of fami- 
liarization appear to approaching an 
asymptotic level under Cond. Ago, perform- 
ance under Cond. NAo was slightly 
than under Cond. Ag. Irrespective of how 
proficient S becomes at pronouncing the 
stimulus units, it takes longer to pronounce 
than not to pronounce; it takes longer to say 
something than to say nothing. 

Intercomparison of the various conditions 
via the critical difference technique (Linquist, 
1953) revealed two significant (P < .05) 
differences, Cond. NAo vs. Cond. A» and 
Cond. NAo vs. Cond. NAg. Inspection of 
acquisition as a function of trials for each of 
the conditions did not reveal evidence of 
interaction between treatments and trials. 

In conclusion, it is apparent that PA in- 
structions regarding S’s response to the 
stimulus term during the anticipation interval 


be 


better 


Journal of Experimental Psychology 
1962, Vol. 64, No. 5, 550-551 
SUPPLEMENTARY REPORT: 
SHORT-TERM 


LLOYD R. PETERSON, KENNETH 


HILLNER, 


rIME BETWEEN P44 


HILLNER, 


AND D. SALTZMAN 

can determine whether SF facilitates or in- 
hibits PA performance. We believe that these 
effects are attributable to covariation of the 
effective length of the anticipation interval 
with amount of familiarization. Furthermore, 
it may be expected that such factors as word 
length, pronunciability, and meaningfulness 
will also, depending on the length of the 
anticipation interval, interact with PA 
structions and amount of familiarization. 
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KINGS 


RETENTION! ¥ 
7 


AND D{YROTHY SALTZMAN 


Indiana University 


Peterson, Saltzman, Hillner, and Land 
(1962) found marked forgetting within 8 sec. 
after a single paired-associate presentation, 
when other presentations filled the interval 
The present study investigates the effect of 
an 8-sec. interval similarly filled which is 
inserted between the first and second pres- 
entations of an individual pair later tested for 
retention. 

Method.--The technique of the previous 
experiment was used. The first as well as the 
second pairing consisted of S reading aloud 
the stimulus and the response from the drum 
Either 0 or 8 sec. separated the two pairings. 
rhe retention interval, measured from 
moval of the second presentation from the 
drum, was 2, 4, 8, or 16 There were 
42 Ss who were tested 16 times in each of the 
eight conditions. Stimuli were three- 
four-letter words. The responses were 
dicated in the instructions to be the numbers 


re- 
sec. 


and 
in- 


' This research was supported by Grant G 12917 
from the National Science Foundation to Indiana 
University, a grant for which the senior author is 
principal investigator 


1-10. Twelve seconds rest separated 16 
blocks of 31-41 exposures, save for a 1-min. 
rest between Blocks 8 and 9. 

Results.—Table 1 shows that massed pair- 
ings resulted in superior recall at the 2- and 
4-sec. retention intervals, while 8-sec. spacing 
was superior at the 8- and 16-sec. retention 
intervals. An analysis of variance found 
significance for both Spacing (F = 10.72, 
df=1/41, P<.01) and Retention (F=71.80, 


rABLE 1 
PROPORTIONS CORRECTLY RECALLED 


First Half Second Half 


Spacing 
Interval 


Retention Interval 
(Sec.) 


Retention Interval 
sec.) 


4 8 16 


.69 |.45 |.48 
66 |.56 |.56 


0 Sec. 


8 Sec. .60 





SUPPLEMENTARY 


df = 3/123, P < .01 rheir interaction was 
also significant (F = 16.47, df = 3/123, 
P <.01). The two halves of the session did 
not differ significantly (F = .73, df = 1/41, 
P > .05). After pooling data from the two 
halves of the session, individual t tests showed 
that differences between the two spacing 
conditions were significant at the 4-, 8-, and 
16-sec. retention intervals (ft 07, 
4.84; df = 41, P <.01). 

A paradox is presented by the finding that 
when interval during which marked 
forgetting can be shown to occur is introduced 
between pairings, there is improvement in 
retention at 


= 3 5.23, 


an 


long intervals. Underwood's 


SUPPLEMENTARY 
AND 
UNDER 


REPOR1 
AVOIDANCE 
THREE 
W 


GORMEZANO, JOHN 


serve d 


1961) ol 


when 


Moore and Gormezano 


that classical conditioning experi 


mentally equated in terms of partial reinforce 


ment pattern and number of UCS occurrences 
a yoking procedure, was inferior to avoid 
The 
tion was conducted to determine 
of UCS intensity 
of classical 

Method rhe 
dure, stimuli 
earlier experiment 
the intensities of the UCS employed 


by 
investiga- 
the effects 
voked comparisons 


ance conditioning present 
on suc h 
ind ivoidance conditioning 
general apparatus, 


proce 
the i 


the 
The only variation was 


and were same as in 

Twenty 
Ss were assigned to each of the six cells of a 
2 X 3 factorial design in 
avoidance conditions were made orthogonal to 
three UCS puffs of nitrogen which had in 
tensities sufficient to support 40-, 80-, and 160- 
mm. columns of mercury. In 
Ss were lost because of apparatus failure 


which classical and 


addition 2 male 
Che 
two recording systems and sex were also made 
orthogonal to the classical-avoidance and UCS 
intensity dimensions 


Results The 
latencies for all six groups in acquisition and 


distributions of response 
The distributions 


the 


extinction were recorded 


not shown) revealed that aside from 


1 This research was supported by Grant G 16030 
from the National Science Foundation A report of 


this experiment was presented at the Psychono 
Society, New York, September 1961 


YOKED COMPARISONS OI 
EYELID CONDITIONING 


{ 


MOOR! AND 


REPOR' 


1961) explanation of distributed practice of 

lists does not seem appropriate here, since he 
concludes that distribution is superior to 
massing only when response learning is in- 
volved. Response learning was minimized 
by the instructions in the present study of 
spacing between individual pairings 
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CLASSICAL 


INTENSITIES! 


CS 


EDWARD DEAT 


higher frequency of responses in the CR range 
for the avoidance groups, the only discernable 
the for the 
responses to decrease in latency, under both 


UCS 


difference was tendency modal 


conditioning procedures, as intensity 
1 reased. 

presents the results of plotting 
CRs the 


groups in acquisition and extinction 


Figure 1 
percentage for six experimental 
Che 
initial points on all acquisition curves are the 
percentage CRs rrial 1 the 
remaining points for successive blocks of 
10 trials. The extinction curves are plotted 

5-trial blocks. The figure indicates that the 
icquisition performance of the avoidance 
(A160, A80, A40), for each of the thres 
UCS intensities, was superior to each of their 
4160 
Y40 


follow 


mean on and 


are 


groups 


respective voked-classical groups 1.€ 
Y160, A80 vs. Y80, and A40 vs 
Though Groups A160 A80 
sentially identical courses of acquisition, 
group Y160 was inferior to Y80. The extinc- 
tion curves reveal that the higher level of 
responding of the avoidance groups in acquisi- 


Vs 


and 


es- 


As was ob- 
served in the previous study, CRs of the 


tion also persisted in extinction 


ivoidance groups demonstrated steeper decay 
functions than the yoked-classical groups and 
are probably in large part due to their having 
started at higher levels of responding 
Split-plot analyses of variance (Snedecor, 





1. GORMEZANO, J. W. MOORE, AND 


poo O mewn age nee eo0eeernnn 
- 
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« A 160 
«A 80 
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PER CENT CRs 
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acquisition 
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1959) on the arc-sine transform of individual performance relative to Y80 sug 


ggests a Partial 
percentage CRs for the 70 acquisition and 20 Reinforcement X UCS Intensity interaction 
extinction trials revealed significant F values \ significant F value was also obtained for the 
for the classical vs. avoidance comparisons in Classical-Avoidance X Sex interaction in ex- 
acquisition (F = 27.15, df = 1/48, P < .005) _ tinction (F = 5.02, df = 1/48, P < .05 
and extinction (F=22.55, df=1/48, P <.005 
The UCS intensity dimension failed to pro 


; and 
examination of the mean percentage CRs 
revealed that under the avoidance procedure 
duce significant differences in either acquisi- males gave a higher percentage of CRs 
tion or extinction. A significant F value was whereas, under the yoked-classical procedure 
obtained in acquisition for the Classical 
\voidance X UCS Intensity interaction 
F = 14.39, df = 2/48, P < .005), reflecting 
the fact that as UCS intensity increased, 
performance of the avoidance groups increased 
while performance of the yoked-classical 
groups decreased. This interaction appears 
to be simply a function of a negative correla 
tion between the performance levels of the Moore, J. W., & Gormezano, I nparisons 
paired Ss. However, the significant inter if paca ee — Coedcel eyelid conditioning 
action is heavily weighted by the low level of | Snepecor,G.W. Statistical methods 
responding of Group Y160. If the perform lowa: lowa State Coll. Press, 1959 


the females were superior. In the remaining 
sources of variation those of statistical sig- 
nificance were without psychological import 
(i.e., interactions involving the recording 
system source of variation 
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